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APPARATUS AND METHOD FOR A RELOCATABLE FILE FORMAT 

5 

LIMITED COPYRIGHT WAIVER 
A portion of the disclosure of this patent document 
contains material to which the claim of copyright 
10 . protection is made. The copyright owner has no 
objection to the facsimile reproduction by any person of 
the patent document or the patent disclosure, as it 
appears in the U.S. Patent and Trademark Office file or 
records, but reserves all other rights whatsoever. 

15 

BACKGROUND 

1. FigJ-d Qt the XpygntiQp 

The invention relates to the loading of software 
into memory for execution by a computer system, and more 
20 particularly, to techniques and file formats which 
facilitate the same. 

2 . Description of Related Art 

Coitqputer programs are typically written originally 
25 in source code in a computer language such as C or 
Pascal, or in an assembly language. To prepare the 
program for execution on a computer system, one or more 
source code modules are passed through a cbn^iler (or 
assembler) which is specific to the language used and 
30 the computer system on which it will be executed, and 
which generates an object code file as its output. A 
linker routine, which is either a separate program or is 
part of the compiler, combines them into a single output 
file, known as an "executable" object code file. One or 
35 more executable s are then loaded together into memory by 
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a loader program, and control is transferred to a start 
address to initiate program execution. 

An executable object code file typically includes, 
among other things, a header section which contains 
5 information about the structure of the file; one or more 
code sections which contains binary instructions which 
are directly executable by the system's CPU; one or more 
data sections; and a loader section, the contents of 
which are described below. 
10 A data section typically contains data which was 

initialized by the compiler in response to the source 
code, descriptors describing various procedure pointers, 
as well as several other types of pointers. The various 
pointers which are contained in the data section may 
15 include some which refer to the address in memory of 
other data objects or of specific computer instructions. 
For example, a pointer may refer to specific objects in 
a code section, such as the entry point of a procedure. 
Other pointers in the data section may contain the 
20 addresses of other objects in the same data section. 
(As used herein, an address may be real or virtual, 
depending on the computer system used) . Further, in 
systems where programs may be compiled into two or more 
executable files and subsequently loaded together, a 
25 data section in one file may contain pointers to objects 
in a code or data section of another file. 

All of these references to absolute addresses must 
be "relocatable" since at the time of coir^ilation and 
linking, the compiler/linker has no way of knowing what 
30 will be the ultimate addresses in memory at which the 
various referenced objects will be loaded. Thus 
references in an executable object code file to an 
address in a code section are often represented in a 
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form which is merely relative to the start of the code 
section, and^ references to an object in a data section 
are represented in a form which is merely relative to 
the starting address of the data section. The loader 
5 program is then able to perform a relocation of these 
references by, after a referenced section is loaded into 
memory and the start address of that section is known, 
merely adding that start address to all of the 
references to objects within that section. 

10 . References to external symbols are typically 

represented in an executable object code file as indices 
into a symbol import/export table which is also 
contained in the file, each import entry in the import/ 
export table identifying both the name of one of the 

15 symbols and the external file which should contain that 
symbol. The indices are often niunbered consecutively, 
and as will be seen below, the table also contains 
export entries. When the loader program encounters a 
reference to an external symbol, it loads the external 

20 file and determines the address of the referenced 
symbol. The loader program then relocates the reference 
by adding in the address of the referenced symbol. 

In order to determine the address of an import 
symbol, the loader searches for the symbol, usually by 

25 name, among the exports in the symbol import/export 
table of the external file. The . entry which contains 
the desired export also designates the address of the 
symbol by designating the section number of the external 
file containing the symbol, and an offset into that 

30 section. 

The loader section of an executable object code file 
typically includes a relocation table containing entries 
which specify how each relocatable reference is to be 
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relocated upon loading into memory. For example, for a 
relocatable -reference to an object which is within a 
code section, the relocation table contains a 
specification that the niimber to be added to the 
5 reference is the start address of the code section, 
rather than the start address of some other section. 
Similarly, for a relocatable reference to an object 
which is contained within a data section, the relocation 
table contains an entry specifying that the number to be 

10 added to the relocatable reference is the start address 
of the data section rather than of a code section. For 
a relocatable reference to an external symbol, the 
relocation table contains a corresponding specification 
of the index to the desired entry in the symbol import 

15 table. 

When the loader program begins operation, it 
retrieves the desired executable object code file from 
a mass storage device such as a disk. If the computer 
system permits multiple tasks to be resident 

20 simultaneously using a shared object code section, then 
a separate copy of the data section (s) is (are) made for 
each task which will use the loaded file. In one 
exanple of memory organization, the loader may first 
check whether the desired file is already present in 

25 memory for another task. If not, the loader loads the 
header, code, data and loader sections of the file into 
a portion of memory which is read-only to individual 
users. In either case, the loader then makes a copy of 
the data section (s) in read/write memory for the new 

30 task. 

If the loader has been invoked to load several files 
or modules into memory at the same time, then these 
files, too, are loaded into memosry in the same manner as 
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the first file. All. the references to external symbols 
are resolved*at this time, by inserting into each file's 
symbol import table the address into which each symbol 
has been loaded. Symbol imports may be resolved 
5 recursively. That is, when one module (e.g. an 
application program) references a symbol in a second 
module (e.g. a library) , the loader may load and perform 
all relocations on the second module before returning to 
resolve the symbol in the first module. Similarly, if 

10 the second module references a symbol in a third module, 
the loader may load and perform all relocations in the 
third module before returning to the second, and so on. 

After the various sections of file have been loaded 
into memory, and imports have been resolved, the loader 

15 performs the relocation process. The relocation process 
is performed by traversing the relocation table in the 
loader section, and performing the specified relocation 
operation for each of the relocatable references 
contained within the current file. 

20 One popular format for executable object code files 

is known as XCOFF. XCOFF is described in the following 
articles published in IBM, "R6000 Inf oExplorer" (CD-ROM, 
1992): 

"a. out File Format", 

25 "Optional Auxiliary Header for the a. out File", 

"Section Headers for the a. out File", 
"Raw Data Sections for the a. out File", 
"Special Data Sections for the a. out File", 
"Relocation Information for the a. out File", 

30 "xcoff.h", 

"filehdr.h", 
"reloc.h" , 
"scnhdr.h", 
" loader. h"; 

35 

all incorporated herein by reference. In XCOFF, the 
loader section has the following portions: 
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10 



15 



Header Portion 

External Symbol In^ort/ Export Table 
Relocation Table 
Import File Ids 
Loader String Table. 

The Header Portion of the Loader Section contains a 
version number and sufficient information to navigate 
the remainder of the Loader Section. 

The External Symbol Import /Export TsUDle contains one 
24 -byte entry for each of the external references for 
the file. The entries are unsorted, with no grouping of 
imports separately from exports, and no grouping of 
imports from one external file separate from inports 
from another external file. Each entry in the External 
Symbol Import/Export Table has the following fields; 



TAB 1 .E X 



20 



25 



Fieiq yram^ 
1 name 



1 value 



1 - scnum 



l_smtype 



l^smclas 
1 ifile 



Length 
Byt:eg 

8 



1 
4 



Description 

Full symbol name (if s 8 bytes) , 
or offset to full symbol name in 
the Loader String Table. 

Offset into 1- scnum where export 
symbol is located; unused if 
import . 

Section number containing the 
export symbol, or 0 if import or 
not defined. 

Symbol type, and flags 
indicating whether the symbol is 
an import, export, and/or entry 
point description. 

Symbol storage class. 

Import file id: ordinal of 
import file Ids in Loader Import 
File Ids portion. Specifies 
source for import symbols. 
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Length 

y^eiq Ngme * Bvtes Description 
l^parm 4 Parameter type check field. 

The Relocation Table contains the relocations 
information for the XCOFF file. Each entry in the 
5 Relocation Table is 12 bytes long and contains the 
following fields: 

TABLE II 

Length 

Fielql Ngmg (Bvtes) Description 

10 l_vaddr 4 Offset within section number 

specified, in l_rsecnm, of an 
information item to be 
relocated. 

l_symndx 4 External symbol import table 

index of object that is being 
referenced. 

l_rtype 2 Type of relocation. 

l_rsecnm 2 Number of the section 

containing the relocatable 
item governed by this table 
entry. 

The l_symndx field of a relocation table entry 
15 specifies whether the item to be relocated is a 
reference to an external symbol, or to an object in one 
of the code or data sections. Specifically, values of 
1 and 2 indicate that the reference is to an object in 
a .data or .bss section respectively (both of which are 
20 considered "data sections" as the term is used herein), 
and a value of 0 indicates that the reference is to an 
object in a .text section (code). Values of 3 or higher 
constitute indices into the external symbol import table 
for the file, and indicate that the relocatable 
25 . reference in the information item is a reference to the 
corresponding external symbol. In this case, the 
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relocatable reference in the information item itself may 
contain 0, or an offset value to which the address of 
the external syxnbol will be added. Note that while 
relocation table entries have the capacity to control 
5 relocations of information items contained in the code 
section, this capacity is rarely used on computer 
systems which support relative branching. For these 
systems, when a con^iler generates a branch instruction 
for the code section, it cypically uses the relative 

10 branch format so as to obviate any need for a 
relocation. When the compiler generates an instruction 
which references a data object, it typically uses an 
Indexed addressing mechanism for which only the offset 
from the base address of the data section need be 

15 included in the ultimately executed code. The software 
pre-loads the starting address of the desired data 
section into a register to use as the base address. 
Further, in the situation where code sections are 
sharable, relocations are avoided in the code section 

20 also because the relocation appropriate for one task may 
not be the same as the relocation appropriate for 
another task sharing the same code section. 

The l_rtype field indicates the type of relocation 
which is to be performed, and most commonly contains a 

25 value indicating that the reference is an cOasolute 
32 -bit reference to the virtual address of the object. 

l_rsecmn indicates the section number containing the 
information item to be relocated. As with the l_symndx 
field, certain predefined values are implicit references 

30 to the .text, .data and .bss sections, respectively. 

The Irrport File Ids portion of the XCOFF file 
contains variable length strings, each carrying the file 
Id for a respective one of the inport files referenced 
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ordinally in the l_if ile field of an import entry in the 
External Symbol Import/Export Table. Similarly, the 
Loader String Table porion of the file contains variable 
length strings, each carrying the text name of a 
5 respective one of the import and export symbols 
referenced in the l_name field of an import or export 
entry in the External Symbol Import/Export Table. 

One problem with the XCOFF file format is that it is 
extremely inefficient in terms of space occupied in the 

10 mass storage device, in terms of memory usage at launch 
time, and in terms of the time required to launch an 
application. The space which an XCOPF file occupies in 
mass storage is in large part due to the fact that XCOFF 
requires 'l2 bytes of relocation information for each 4- 

15 byte word in the data section that requires relocation. 
Thus in an executable object code file containing 1.5 
megabytes, as much as 300k bytes might be occupied by 
the relocation table. The relocation table space 
overhead is also a large factor in the inefficient usage 

20 of memory at launch time. The inefficiency of launch 
time speed performance is due in part to the need to 
retrieve and interpret 12 bytes for every relocation to 
be performed. 

The inefficiency of the XCfOFF file format is due 

25 also in part to the absence of any organization in the 
External Symbol In^ort/Bxport Table. This means that 
when the loader program needs to import a symbol 
exported by an XCOFF file, it must search through the 
entire External Symbol Import/Export Table until it is 

30 found. This can amount to hundreds or thousands of 
entries in some programs, which may need to be searched 
hundreds or thousands of times (once for each in5)ort 
from the file) . Moreover, each search can involve a 
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lengthy string comparison of the symbol name of an 
export in the Table, with the desired symbol import 
name. 

An XCOFF loader program may attempt to speed this 
5 process by reading the entire External Symbol Import/ 
Export Table of an imported XCOFF file into read/write 
memory before performing any comparisons, but this 
itself takes time and requires an extensive amount of 
memory. Additionally, the entire Loader String Table 

10 would also need to be loaded into memory, and in a 
virtual memory computer system there is a strong 
likelihood that the need to repeatedly access the 
External Symbol Irr^ort /Export Table entries and the 
Loader String Table in an alternating manner will cause 

15 repeated page faults and further degrade performance. 
Moreover, a typical importing file may need to import 
only 10% of the symbols exported by the imported file 
being read into memory in this manner. Thus the vast 
majority of the entries in the External Symbol 

20 Import/Export Table and the Loader String Table which 
are brought into memory in order to speed import 
resolution, may never be used. 

In addition to XCOFF, another conventional format 
for executable object code files is used in the GEM disk 

25 operating system for Atari ST computers. See K. Peel, 
"The Concise Atari ST 68000 Programmer's Reference 
Guide" (Glentop Publishers: 1986), especially pp. 2-21 
through 2-24. The entire Peel guide is incorporated 
herein by reference. 

30 In the GEM format, the loader section of an 

executable object code file consists of a series of 
bytes, each of which specifies at most a single 
relocation. A loader routine maintains a pointer into 
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the program being loaded, and updates the pointer in 
dependence *upon each byte in the loader section. 
Specifically, if a byte in the loader section contains 
any number between 2-255 inclusive, the loader routine 
5 adveuices the pointer by the specified number of bytes 
and adds the start program address? to the 32 -bit 
relocatable reference then pointed to by the pointer. 
If the byte in the loader section contains the value 1, 
then the loader routine advances the pointer by 254 

10 bytes without performing a relocation, A zero byte in 
the loader section indicates the end of relocations. 

The GEM executable object code file format and 
loader routine are extremely primitive, lacking any 
capability for symbol imports and exports, for separate 

15 code and data sections, or for any kind of relocation 
other than the addition of the start program address to 
a 32 -bit relocatable reference. Additionally, like the 
XCOFF format, the GEM format still contains a relocation 
table entry (byte) for each relocation to be performed. 

20 

SUMMARY O F THE INVENTION 
The invention takes advantage of certain 
characteristics of executable object code files to 
drastically reduce the number of bytes of relocation 

25 information which are required per relocation. In 
particular, roughly described, relocation table entries 
in an executable object code file are interpreted as 
relocation instructions rather than individual 
specifications for a particular respective relocatable 

30 information item. An abstract machine is provided for 
interpreting the relocation instructions and performing 
various relocation operations and various control 
functions for the abstract machine, in response to the 
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relocation instructions. The abstract machine maintains 
certain variables containing information which is 
referenced and updated in response to certain types of 
the relocation instructions, thereby obviating the need 
5 to include such information as part of each relocation 
instruction. Certain of the relocation instruction 
types can also specify a particular relocation operation 
to be performed on a run of n consecutive relocatable 
information items, where n is specified as part of the 

10 relocation instruction. Other types of relocation 
instructions are also made available. 

In another aspect of the invention, an executable 
object code file separates the in^ort symbol table from 
the esqport symbol tedDle, and in a combined string table, 

15 locates the import symbol name strings closer to the 
import symbol table and locates the export symbol name 
strings closer to the export symbol table. The export 
symbol table is also sorted according to a hashing 
algorithm. Additionally, pattern initialized data 

20 sections are stored as data expansion instructions to 
further reduce the size of the object file. 

BRIEF DESCRIP TION OF THE DRAWINGS 
The invention will be described with respect to 
25 particular embodiments thereof, and references will be 
made to the drawings, in which like elements are given 
like designations, and in which: 

Fig. 1 is a symbolic, simplified block diagram of a 
computer system incorporating the invention; 
30 Fig. 2 is a simplified flow chart illustrating the 

broad functions of a loader program using the present 
invention; 
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Fig. 3 is a symbolic diagram of information storage 

in. the memory of Fig. 1; 

Fig. 4 is a flow chart detailing the RESOLVE IMPORTS 

step of Fig. 2; 
5 Fig. 5 is a flow chart detailing the PERFORM 

RELOCATIONS step of Fig. 2; 

Fig. 6 illustrates the structure of a data section 

generated by a typical compilers- 
Figs. 7A-7G illustrate field definitions for 
10 relocation instructions recognized by the abstract 

machines- 
Fig. 8 is a flowchart detailing the FIND NEXT IMPORT 

SYMBOL NAME Step in Fig. 4; 

Fig. 9 is a simplified flowchart illustrating the 
15 broad functions of a program for constructing a file 

according to the present invention ; 

Fig. 10 is a flowchart detailing the CONVERT SYMBOLS 

step of Fig. 9; 

Fig. 11 is a flowchart detailing the SORT PEF EXPORT 
20 SYMBOLS step of Fig. 9; 

Fig. 12 is a flowchart detailing the CONVERT 

RELOCATIONS TO PEF RELOCATION INSTRUCTIONS step of 

Fig. 9; 

Fig. 13 is a flowchart detailing the DO SYMR, SYMB, 
25 LSYM INSTRUCTIONS step of Fig. 12; 

Fig. 14 is a flowchart detailing the DO DDATA, CODE, 
DATA INSTRUCTIONS step of Fig. 12; and 

Fig. 15 is a flowchart detailing the DO DESC 
INSTRUCTIONS step of Fig. 12. 

30 

DETAILED DESCRIPTION 

Fig. 1 is a symbolic block diagram of a computer 
system in which the invention is implemented. it 
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conprises a CPU 102 which is coupled to a bus 104, The 
bus 104 is also coupled to a memory 106 and a disk drive 
108. Numerous types of computer systems may be used, 
but in the present embodiment, coirputer software is 
5 usually stored on the disk drive 108 and brought into 
memory 106 over the bus 104 in response to commands 
issued over bus 104 by the CPU 102. The CPU 102 runs an 
operating system which, possibly together with 
specialized hardware, supports virtual addressing. That 

10 is, a computer program running on CPU 102 can reference 
memory addresses in an address space which is much 
larger than the physical space in memory 106, with the 
operating system and/or specialized hardware swapping 
pages from memory 106 to and from the disk drive 108 as 

15 necessary. The operating system also supports multi- 
tasking, in which two or more tasks can execute 
different copies of the same or different software in a 
parallel or time -sliced manner. 

Executable object code files stored on the disk 

20 drive 108 are referred to herein as containers, and 
those that incorporate features of the presently 
described embodiment are referred to as PEF containers 
(that is, they follow a "PEF" format) . Such files may 
also be retrieved from a different part of memory 106, 

25 or from another resource (not shown) . A PEF container 
may contain the compiler output from a single source 
code module or, if the compiler is able to combine more 
than one source code module, the output from a plurality 
of source code modules. Several different containers 

30 (PEP or otherwise) may be required to form a complete 
program. 

OVERALL PEP CONTAINER STRUCTURE 
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Each PEF container contains the following portions; 
a container header, N section headers, a string table, 
and N section data portions. The container header is of 
a fixed size and contains global information about the 
5 structure of the PEF container. The exact contents of 
the container header are \inimportant for an 
understanding of the present invention and are therefore 
not described herein. 

The section headers portion of a PEF container 

10 includes a section header for each of N "sections". 
Every PEP section, regardless of whether it includes any- 
raw data, has a corresponding section header. Such 
sections include, for exaxr^le, loadable sections such as 
code, data and pattern initialized data (pidata) 

15 sections, and nonloadable sections such as a "loader" 
section. Each of the sections has a corresponding 
section number which is derived from the order of the 
section headers in the section header portion of the PEF 
container, starting with section 0. Section 0 is 

20 typically a code section and section l is typically a 
data section, but this is not a requirement. 

Each section header is 28 bytes long and has the 
following structure: 

TABLE III 

25 No. of 

Bytes Description 

4 Section name (offset into global string 
table) . 

4 Desired section address in memory (typically 
0) . 



4 Execution size in bytes. 

Initialization siz< 
zero- initialized e: 

Raw size in bytes. 



30 4 Initialization size in bytes (prior to any 

zero- initialized extension) . 
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4 Offset from beginning of container to this 
section's raw data. 

1 Region kind (code section, data section, 
pidata section, loader section, etc.). 

1 Memory alignment required for this section 
(byte alignment, half-word alignment, full- 
word alignment, etc.). 

1 Sharing kind (unsharable section, sharable 

within a context; sharable across all context 
within a task; sharable across all contexts 
within a team; or sharable across all 
contexts . 

5 1 Reserved . 

The exact meaning of each of the elements in a 
section header is unnecessary for an understanding of 
the present invention, except as included in the table 
10 above, and except to note that the section kinds include 
the' following sections: 

TABLE IV 
Mfifl Description 

code A code section is loadable and contains 

read-only executable object code in binary 
format, in an uncompressed, directly 
executable format. 

20 

data A data section is loadable and contains 

initialized read/write data followed by 
read/write zero- initialized data. The 
presence of zero- initialized read/write 

2^ data is indicated by the section size 

being larger than the image size. A PEF 
data section combines the .data and .bss 
sections of a standard UNIX executable 
object code file. Data sections are not 

30 compressed. 

pidata A Pidata section is loadable and 
identifies a read/ write data region 
initialized by a pattern specification 
3S contained in the section's raw data in the 



wo 94/22978 



PCTAJS94/029S2 



- 17 - 



10 



15 



20 



25 



object file. In other words, the raw data 
^ portion of a PEF container which 
corresponds to a pidata header contains a 
small program that is interpreted by the 
loader to determine how the section of 
memory should be initialized. Zero 
extension is automatically handled by the 
loader for pattern- initialized data, as 
with the data section. The format of a 
pidata section is described below. 

constant A constant section . is loadable and 
contains read-only data. The loader can 
place it in read-only memory, and a 
constant section is not compressed. 

loader The loader section is nonloadable and 
contains information about imports, 
exports and entry points. The format of 
the loader section is discussed below. 

debug The debug section is nonloadable and 
contains all information necessary for 
symbolic debugging of the container. 
Multiple formats can be supported, 
including the format of a conventional 
XCOFF .debug section. 



30 



35 



40 



The PEF container format includes no region kind 
specifically for zero- initialized data, similar to a 
.bss section in a conventional XCOFF file. Rather, 
zero- initialized sections are achieved in PEF simply by 
specifying a data section with an init size of 0. The 
loader uses the exec size to determine how much space to 
allocate to the section in memory, and then uses the 
difference between the exec size and the init size to 
determine how much space to zero- initialize. Thus the 
effect of a data section with a 0 init size is to zero- 
initialize the entire data section in memory. Note 
there is no requirement that zero- initialized data 
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sections immediately follow other data sections in 
memory . 

The section data portion of the PEF container 
contains raw section data corresponding to each of the 
5 headers in the section header portion of the PEP 
container (except for zero- initialized sections). The 
sequence of section data portions need not be the same 
as the sequence of their corresponding section headers. 
The raw data within certain sections including code, 
10 data and loader sections are required to be 8-byte 
(double -word) aligned. For code sections, the raw data 
simply contains the executable object code. For data 
sections, the raw data contains the initialized data for 
the section. For the loader section, the raw data 
15 contains additional information needed by the loader as 
set forth in more detail below. 
Loader Section Format 

The loader section contains the following portions: 
Loader section header 
20 Table of Import Container ID's 

Import Symbol Table 
Relocation headers 
Relocation instructions 
Loader String Table 
25 Export Slot Table 

Export Chain Table 
Export Symbol Table. 
Loader Section Header . The loader section header 
has a fixed size, and provides information about the 
30 structure and relative location of the various 
components of the loader section. Most of the entries 
in the loader section header are unimportant for an 
understanding of the present invention, except to note 
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that it contains, among other things, one entry 
specifying the number of entries present in the Table- of 
Insert Container ID's, one entry indicating the nximber 
of entries in the "irt^ort syinbol table" portion of the 
loader section, one entry indicating the number of 
sections in the PEF container which require relocations, 
one entry indicating the byte offset from the start of 
the loader section to the start of the relocation 
instructions {Relocation Table) , one entry indicating an 
offset to the Loader String Table, one entry indicating 
an offset to the Export Slot Table, a "hash slot count" 
(indicating the number of "slots" in the Export Slot 
Table), and an entry indicating the number of symbols 
which are exported from this object file (including re- 
15 exports of in^orts) . All tables use zero-based indices. 

Tabl^ of Import Container ID' a . The Table of Irt^ort 
Container ID's contains an entry for each external 
container which may contain a symbol to be imported into 
the present PEF container. Such external containers may 
20 follow a conventional format such as XCOFF, or 
preferably follow the same format as described herein 
for a PEF container. Each entry in the Table of Import 
Container ID's contains, among other information which 
is not in^ortant for an understanding of the present 
25 invention, the following fields: an offset into the 
Loader String Table to indicate the name of the 
container, a count of a number of symbols which are 
contained in the Import Symbol Table for the particular 
external container, and a zero-based index of the first 
entry in the Import Symbol Table (in the present file) 
for the particular external container. 

Jmport Symbol Tablfi . The Import Symbol Table 
contains one entry for each external symbol to be 



30 
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imported. Each Inqport Syiribol Table entry contains a 
symbol class * identifier together with an offset into the 
Loader String Table to indicate the name of the symbol. 
All of the symbols to be imported from a given one of 
5 the external containers are grouped together in the 
Import Symbol Table, although within that group they may 
appear in any order. Imported symbols may be in any of 
the following classes: an address in code, an address in 
data, a pointer to a transition vector (also referred to 
10 herein as a descriptor or procedure pointer) , a TOC 
symbol, or a linker inserted glue symbol. 

Relocation Headers. The Relocation Headers portion 
of the loader section of a PEF container contains one 
entry for each loadable section of the PEF container 
15 which includes one or more information items to be 
relocated. As used herein, an "information item" may be 
a word, a double word, or any other unit of information 
which contains one or more relocatable address fields. 
Unlike the conventional file format, in which each 
20 relocatable information item consisted of exactly one 
relocatable address field, a relocatable information 
item in PEF may contain any number of relocatable 
address fields as defined by the relocation instruction. 
Note that although relocations of information items 
25 which are contained within a code section are supported 
by the PEF container definition, code sections 
preferably do not include any load time relocations at 
all. This permits code sections to be located outside 
of user-writable memory, and also permits code sections 
30 to be sharable- Note also that though multiple data 
sections are supported, typically only one data section 
will exist. 



wo 94/22078 



PCTAJS54/02952 



- 21 - 

Each header in the Relocation Headers portion of the 
loader section of a PEF container has the following 
format: 

TABLE V 

5 No. of 

Bytes Description 

4 Section number. 

10 4 Number of bytes of relocation instructions 

for this section. 

4 Offset from .the start of the Relocation 

Table to the first relocation instruction 
15 for this section. 

The section number field of a relocation header 
identifies the section niomber of a loadable section 
whose information items are to be relocated. Section 

20 number 0 may, for example, be a code section, and 
section number 1 may, for example, be a data section. 
As set forth above, the section numbers are assigned 
depending on their sequence in the "Section Headers" 
portion of the PEF container. Also as set forth above, 

25 no relocation header will typically be included for a 
code section since code sections preferably contain no 
relocatable information items. Additionally, section 
numbers -1 and -2 have special meanings which are not 
here relevant; these sections do not contain 

30 relocations. Section number -3 indicates a re -export - 
of-inqport, and does not contain any relocations. 

Relocation Table. The Relocation Table in the 
loader section of a PEF container consists of an array 
of relocation instructions. All of the relocation 

35 instructions for information items within a given 
section of the PEF container are grouped together. The 
relocation instructions are two or four bytes in length 
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and contain an opcode in the high- order bits. The 
remaining fields of the instruction depend upon the 
opcode and are described hereinafter. 

Loader String Table. The loader string table 
5 contains all of the strings referenced by other portions 
of the loader section. The strings are all variable in 
length, and include both import and export symbol names 
and file names of imported libraries. The loader string 
table is located in the middle of the loader section, 

10 after the import symbol table and relocation table, but 
before the symbol export portions of the loader section. 
All of the import symbols are grouped together and 
located nearest the iirqport symbol table, and all of the 
export symbols are grouped together and located nearest 

15 the export symbol table. Both the grouping of the 
symbols and their locations relative to the two symbol 
tables tend to improve performance on loading in a 
virtual memory computer system, since page hits are more 
likely. 

20 Symbol Export Portions . The PEF container 

organization hashes a container's export symbols into a 
hash data structure at container build time. This 
considerably speeds the search for exported symbols when 
they are to be iit^orted into another container at load 

25 time. The symbol export portions of the loader section 
of a PEF container include an Export Slot Table, an 
Export Chain Table, and an Export Symbol Table. 

In the hash data structure, the Export Slot Table 
contains an entry for each hash "slot" in the data 

30 structure. Each entry contains a chain count indicating 
the number of export symbols which have been placed in 
the slot corresponding to the entry's index, and an 
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index into the Export Chain Table of the first Export 
Chain Table entry for the slot. 

The Export Chain Table contains an entry for each 
export symbol, grouped according to their slots. Each 
5 entry contains the hashword value for the entry's export 
symbol . 

The Export Symbol Table contains an entry for each 
symbol in the current PEF container which may be 
imported by another container. These entries include an 

10 offset into the Loader String Table of the current 
container to identify the name of the symbol, the number 
of the section in the current PEF container which 
contains the symbol, and the offset into that section at 
which the symbol may be found. 

15 Thus to locate an export symbol given its name, the 

loader program first computes the hashword value for the 
symbol name using a predefined Name- to -Hash -Word 
function. It then uses a predefined Hash-Word- to-Hash- 
Slot -Number function, together with the size of the 

20 Export Slot Table in the given container (obtained form 
the Loader Section header) to compute the slot n\unber 
for the export symbol. It uses the hash slot number as 
an index into the Export Slot Table, from where it 
fetches the chain count and the Export Chain Table index 

25 of the first Export Chain Table entry for the slot. 
While the count is valid, the Export Chain Table is 
searched starting at the Export Chain Table index 
previously fetched. When a hashword matches that 
computed for the desired export symbol, the Export Chain 

30 Table index of that hashword designates the export index 
number of a potential name match. To see if the name 
really matches, the loader program applies the export 
index to the Export Symbol Table, which contains the 
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offset to the export name's text, which is compared to 
the desired -export symbol name. If it matches, then the 
information eibout the export identified by that index is 
returned. Otherwise, as long as the count is still 
5 valid (if not, symbol is not found) , the next export 
index is checked. 

A PEF loader program can also locate an export 
symbol given its export symbol index. To find 
information about an export symbol by (zero based) 

10 index, the PEF Loader applies the index to the Export 
Chain Table to fetch the hashword value for that symbol, 
or applies the index to the Export Symbol Table to fetch 
the rest of the information about the value. The Esqport 
Symbol Table contains a pointer to the actual name text. 

15 The general process for construction of the Esqport 

Slot Table and the Export Chain Table in the presently 

described embodiment is as follows: 

1. Compute the number of elements in the Export 
Slot TcJjle. This is based upon the number of 
20 exports, but can vary depending on desired 

Export Slot Table size overhead. Write the 
Export Slot Table size into the Loader Section 
header . 

25 2. For each export, compute its hashword value 

using the Name -to- Hash -Word function. Then 
compute its export hash slot value based upon 
the Export Slot Table size, using the Hash- Word- 
to -Hash- Slot- Niunber function. 

30 

3. Sort the exports by hash slot number. This 
order groups together all exports with the same 
slot number, and creates the (zero based) symbol 
index for each export. At each index in the 

35 Export Chain Table, write, the hash word for the 

export. At each index in the Export Symbol 
Table, write the symbol table entry. 

4. Construct the Export Slot Table (size given by 
40 step 1) . Each entry in this table has the count 
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Of exports, as well as the index of the first 
export whose hash slot values collide at that 
hash slot index. 

5 The following is a preferred method for computing an 

Export Slot Table size for use in the hashing process, 
although other methods could be used instead. The hash 
slot table size function computes an appropriate size 
for the Export Slot Table, given the number of exports. 

10 The size is always a power of 2, and therefore the 
output of this function is to give the number of the 
power of 2. The constant kAvdChainSize is normally 5, 
but can vary as desired for a size vs. speed trade off. 
Note: there is a very small penalty for longer hash slot 

15 chains, which result from shorter hash slot tables, 
because the data structures are optimized to account for 
hash slot collisions, 
int NumSlotBits ( long escportCount ) 

20 register int i; 

for (i s 0; i < 13; i++) { 

if ( ea^ortCount / (1 « i) < kAvgChainSiae ) break; 

25 

if (i < 10) return i+i; 
return i; 
} /* NumSlotBits () */ 

30 The preferred Name- to-Hash-Word function used to 

compute the hashword for export symbols in the loader 
section, encodes the length of the identifier as the 
high order 16 bits, and the low order bits are 
constructed by accumulating for each byte the following 

35 operation: rotate the current hash value 1 bit, and xor - 
in the next byte of the name. It is implemented as 
follows : 

itdefine ROTL(x) ( ( (x) « 1 ) - ( (x) » (16) ) ) 

40 /* produce hash value for id given length (length == 0 --> mill 
terminated) ♦/ 
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unBigned long Hash (register unsigned char *name, register int 
length) 

register long hash s 0; 
5 register int len s O; 

while (*name) { 

hash « ROTL { hash ) ; 
hash *s *naine-i-+; 
10 len-i-f; 

if (-^length » o) break; 

return (unsigned short) (hash (hash >> 16)) * (len « 16); 
} /* Hash 0 */ 

15 

The Hash-Word- to-Hash-Slot-Nvunber function converts 

a hash value as computed above, into an appropriately 

small index number. The preferred slot function is 

based upon the size of the Export Slot Table, also 

20 confuted as above. 

#define HashSlot (h,S,M) ( ( (h) * ( (h) » (S) ) ) & (M) ) 
hashSlotlndex = HashSlot ( haBhWord32, htShift, (1 « htShift) - 1 ) 

Pidata Section Format 

25 The raw data of a pidata section is organized as a 

sequence of data expansion instructions. Each 
instruction begins with a 3 -bit opcode followed by a 5- 
bit count, together in the first byte. Then depending 
on the opcode, additional counts and/or raw data bytes 

30 may follow. The opcodes are as follows: 

Opcode 0: Zero #Couiit 

This opcode takes one count parameter. It clears Coma bytes starting at the current data 
3 5 location. 

Opcode I : Block #Omat {Count if of raw data b>tes) 

This opcode takes one count parameter, and count number of raw data bytes. It deposits 
40 the raw data bytes into the data location, essentially performing a block byte transfer. 

Opcode 2: Repeat ^Countl, mCount 2 (CountJ # of raw data bytes) 
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This opcode takes two count parameters, and Coimtl number of raw data bytes. It 
repeats the CountI number of raw data bytes RG7unr2 times. 

Opcode 3: RepeatBIock #Count 1, #DCount2, #RCount3 (raw data bytes) 

This opcode takes three count parametres and CowUl + DComt2 * RCountS number 
of raw data bytes. The raw data bytes are interpreted as follows: the first Coimtl bytes 
make up the "repeating pattern". The next DComt bytes make up the first "non- 
repeating part" . There are RCountS number of "non-repeating part"s. The opcode first 
10 places the "repeating pattern", then one "non-repeating part". TTiis is performed 
RCoum3 times, each time using the same "repeating pattern" and a new "non-repeating 
'.part". Lastly, one more "repeating pattern" is placed. 

Opcode 4: Repeat Zero MCountl, 0DCount2, RCountS (raw data bytes) 

This opcode takes three count parameters and DCowu2 * RCountS number of raw data 
bytes. The raw data bytes are interpreted as follows: The "rq)eating pattern" is made 
up of Couml bytes that are not represented in the raw data bytes. The first DComt 
bytes make up the first "non-repeating part". There are RCoumS number of ••non- 
20 repeating part"s. The opcode first places the "repeating pattern" (zeroes), then one 
"non-repeating part". This is performed RCountS times, each time using the same 
"repeating pattern" (zeroes) and a new "non-repeating part". Lastly, one more 
"repeating pattern" (zeroes) is placed. 

25 

LOADER PROGRAM 

Fig. 2 is a simplified flow chart illustrating broad 
functions of a loader program which may employ the 
present invention. The LOAD CONTAINER routine 202 
30 illustrated in the figure is recursive as hereinafter 
described. 

In a step 204, the entire PEF container then being 
loaded is brought from the source of the container (disk 
drive 108, another portion of memory 106, or a resource 
35 (not shown) ) into a portion of memory 106 which is not 
writable by a user program. If the desired container is 
already present in memory 106, having been previously 
loaded by another task, then this step 204 is skipped. 
Fig. 3 is a symbolic diagram of memory array 106 for 
40 an illustration of the LOAD CONTAINER routine for a 
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situation in which the primary PEF container is an 
application ' program, and in which the application 
program includes calls to external symbols which are all 
located within a second PEF container referred to herein 
5 as a library container. As shown in Fig. 3, the 
application program is loaded in step 204 into a region 
302 of the read-only portion of memory 106. The 
application program includes (among other sections not 
shown) a header section loaded into a memory region 304, 

10 a code section loaded into a memory region 306, a data 
section loaded into a memory region 3 08, and a loader 
section loaded into a memory region 310. 

Referring again to Fig. 2, after the container is 
loaded into memory 106, the LOAD CONTAINER routine 

15 allocates and fills read/write memory regions for all 
the data and pidata sections (step 206) . For the 
illustration of Fig. 3, this step involves copying the 
data section from region 308 of memory 106 into a region 
312 in a read/write part of memory 106. The regions to 

20 be allocated in this step are identified in the header 
section 304 of the application program, and the data to 
be copied or otherwise written into the allocated memory 
regions derive from the corresponding section data 
portions of the application program, 

25 Also in step 206, a regions [i] • table is created to 

indicate the virtual start address for each region i 
created for the present container. Specifically, if the 
present container includes only one code section, and it 
is identified in the Section Headers portion of the 

30 container as a section 0, then regions [0] will carry the 
virtual address in read-only memory at which the code 
section was placed. In the example of Fig. 3, 
regions [0] will carry the virtual address of the start 
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of memory region 306. Similarly, if section 1 of the 
container ife a data section, then the table entry 
regions [1] will carry the virtual address of the data 
section in memory 106 as copied for use by the current 
5 task. In the eacample of Fig. 3, the table entry for 
regions [1] would carry the virtual address of memory 106 
region 312. As mentioned, multiple code and data 
sections are supported and may occur in any order. Each 
is given a number, and the regions [i] table indicates 

10 where each section is located in memory 106. 

Referring again to Fig. 2, after the regions [i] 
table is created, all references to external symbols are 
resolved in a step 208 and in a manner hereinafter 
described. (Note that in the present embodiment, though 

15 not important for an understanding of the present 
invention, certain external symbol imports need not be 
resolved at this time if they carry a flag indicating 
that the import need not be resolved.) The result of 
the RESOLVE IMPORTS step 208 is an imports [j] table 

20 which carries the virtual address of each of the symbols 
j which are referenced in the present container. The 
RESOLVE IMPORTS step 208 is described in more detail 
below. 

After the imports are resolved, a decision is made 
25 in a step 212 as to whether any relocation headers are 
present in the Relocation Headers portion of the loader 
section of the PEF container. If not, then the user 
program is now fully loaded into memory 106 and the LOAD 
CONTAINER routine 202 is complete. The user program may 
30 be executed at this time. If the Relocation Headers 
portion of the loader section does contain one or more 
relocation headers, then the loader executes a PERFORM 
RELOCATIONS step 210 on the section indicated by the 
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next relocation header. As mentioned above, there is no 
requirement 'that the sequence in which sections are 
relocated be the same as the sequence of the sections in 
the PEF container, nor is there any requirement that 
5 sections be relocated in numerical order. After the 
relocations are performed on all the information items 
within the section identified in the current relocation 
header, the index into the relocation headers is 
incremented in a step 216, and the routine returns to 

10 the decision step 212 in order to determine whether any 
further Relocation Headers remain in the Relocation 
Headers portion of the loader section of the current PEF 
container. This process continues until all of such 
sections have been processed. 

15 Symbol Import Resolutinn 

Fig. 4 is a flow chart detailing the RESOLVE IMPORTS 
step 208 (Fig. 2). Generally described, the RESOLVE 
IMPORTS routine 208 loops through all of the Import 
Container ID's in the Table of Import Container ID'S in 

20 the Loader section header of the current PEF container. 
For each such Import Container ID, the RESOLVE IMPORTS 
routine 208 recursively calls the LOAD CONTAINER routine 
to bring the referenced container into memory and to 
recursively resolve the inserts specified therein. Once 

25 control returns to the RESOLVE IMPORTS routine for the 
present container (referred to herein as the "importing 
container"), the routine cycles through all of the 
entries in the Import Symbol Table in the loader section 
of the importing container. For each such entry, the 

30 routine compares the name of the imported symbol to 
various ones of the strings identified in the Export 
Symbol Table of the container most recently loaded 
(referred to herein as the "imported container") until 
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the hashed 6rganization of PEF export symbol tables. 
The virtual address of the imported symbol is then 
calculated from the corresponding entry of the Esqport 
5 Symbol Table of the inported container and stored in the 
in?)orts[j] table. 

Accordingly, the RESOLVE IMPORTS routine 208 begins 
with a step 402 in which it is determined whether any 
entries remain in the Table of Import Container ID'S for 

10 the present container. If not, then the RESOLVE IMPORTS 
routine 208 is finished and the routine is exited in a 
step 404. If entries do remain in the Table of Import 
Container ID'S, the LOAD CONTAINER routine is 
recursively invoked, in a step 406, for the next 

15 referenced container. As previously mentioned, 
containers need not all follow the PEF format, and if 
the next specified container is not in the PEF format, 
then another appropriate routine is invoked instead of 
LOAD CONTAINER. 

20 After control is returned to the RESOLVE IMPORTS 

routine 208 for the present container, a determination 
is made whether the Loader Import Symbol Table in the 
loader section of the present container includes any 
further symbols to be imported from the container most 

25 recently loaded (step 408) . If not, then control is 
returned to step 402 to determine whether the present 
container requires loading of any further external 
containers . 

In the illustration of Fig. 3, recursive execution 
30 of the LOAD CONTAINER routine 202 results in the loading 
of the library container into a region 314 of the read- 
only part of memory 106. The container header of the 
library container is placed in a region 316, the code 
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section is placed in a region 318, the data section is 
placed in a 'region 320, and the loader section for the 
library container is placed in a region 322. The data 
section 320 is also copied to region 324 in the 
5 read/write part of memory 106 for the present task.. A 
separate regions [i] table is created for the library 
container and used as hereinafter described to perform 
relocations in the library container and, after 
processing resumes for the application program, to 

10 calculate the virtual addresses of symbols imported from 
the library container. 

Returning to the flow chart of Fig. 4, after it is 
determined that additional symbols remain to be inported 
from the most recently loaded container, the RESOLVE 

15 IMPORTS routine 208 finds the next iirqport symbol name 
(from the Import Symbol Table of the present container) 
in the Export Symbol Table of the container most 
recently loaded (step 410) . This is accomplished by the 
steps illustrated in Fig. 8. 

20 Referring to Fig. 8, the step 410 begins by 

computing the hashword value for the desired import 
symbol name using the predefined Name- to -Hash -Word 
function (step 802) . In a step 804, the routine uses 
the predefined Hash-Word-To- Hash -Slot -Number function, 

25 together with the size of the Export Slot Table, to 
compute the export slot number where the import symbol 
should be found. In step 806, the routine uses the hash 
slot number as an index into the Export Slot Table, from 
where it fetches the chain count and the Export Chain 

30 Table index of the first Export Chain Table entry for 
the slot. In a step 808, the routine loops through the 
Export Chain Table, starting from the index previously 
fetched, for up to the number of entries designated by 



wo 94/22078 



PCT/US94/02952 



- 33 - 

the chain count. For each entry in the Export Chain 
Table, the routine determines whether the hashword for 
that entry matches that computed for the desired import 
symbol (step 810) . If not, then the loop continues 
5 (808) . If so, then the Export Chain Table index of that 
hashword is applied Export Symbol Table to obtain the 
offset to the export name's text in the Loader String 
Table (step 812), and it is determined whether the 
corresponding Loader String Table entry matches the 
10 desired import symbol name (step 814) . if not, then the 
loop continues (step 808), and if so, then the import 
has been found in the export symbol table and the 
routine returns to the caller (step 816) . 

Thus the completion of step 410 results in an index 
15 to the referenced entry in the Export Symbol Table of 
the most recently loaded container. In a step 412, the 
information -in that Export Symbol Table entry is used to 
calculate the virtual address of the symbol to be 
imported, which is then stored in the imports tj] table 
20 for the importing container. In particular, the virtual 
address of the imported symbol is calculated as the 
virtual address of the section in the imported container 
which contains the in^orted symbol, plus the offset into 
that section where the symbol is located. The virtual 
25 address of the section in the iit^orted container which 
contains the imported symbol is taken from the 
regions [i] table for the imported container, and both i 
and the offset are taken from the imported container's 
Export Symbol Table entry which has been identified for 
30 the symbol to .be imported. The symbol number j is the 
symbol number as seen by the present, or importing 
container . 
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Note that if the section number in the Export Symbol 
Table entry for the desired symbol contains the special 
section nuxriber -2, then the virtual address is taken 
directly from the offset field of that entry. If the 
5 section number is -3 (indicating a re-export of a symbol 
imported into the library container from yet a third 
container) , then the offset is used as an index into the 
library's import symbol table. 

After the virtual address of the present symbol is 

10 calculated and stored in imports [j], j is incremented in 
a step 414 and control is returned to the decision step 
408 to determine whether any further symbols need to be 
imported from the most recently loaded container. 
Relocating References 

15 Pig. 5 is a flow chart detailing the PERFORM 

RELOCATIONS step 210 in Fig. 2. As mentioned, the 
PERFORM RELOCATIONS routine is invoked for each of the 
loadable sections i in the current PEF container, in the 
sequence specified in sequential relocation headers in 

20 the current PEF container. The current section number 
i . is passed as a parameter to PERFORM RELOCATIONS 
routine. A C- language procedure which includes the 
PERFORM RELOCATIONS routine 210 is set out in 
Appendix A. 

25 Referring to Fig. 5, the routine first checks for 

errors in a step 502, and aborts if an error is 
detected. Such errors can include the receipt of a 
section number which is less than zero or greater than 
the total number of sections in the current container. 

30 Another error might be that the section number provided 
by the calling routine is a constant section or the 
loader section, neither of which contain any relocatable 
items . 
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After the initial error checking step 502, several 
variables are initialized in a step 504. As will be 
seen, the individual elements of the Relocation Table 
are interpreted at a nwch higher level in the present 
5 embodiment than would relocation table entries in a 
conventional relocation table. This interpretation is 
performed in the present embodiment entirely in 
software, by routines which simulate decoding and 
execution of the relocation instructions in the 
10 Relocation Table. Thus the interpretation software 
forms a "pseudo machine" or "abstract machine" for 
interpreting and executing the relocation instructions. 

Accordingly, the following abstract machine 
variables are initialized in the step 504: 

15 reloc Pointer to the next relocation instruction 

in the Relocation Table of the loader 
section of the current PEF container. 
Initialized to point to the first 
relocation instruction for the current 

20 section number as passed to the PERFORM 

RELOCATIONS routine 210. 

rlEnd Pointer to the first byte after the last 
relocation instruction for the current 

25 section number as passed to the PERFORM 

RELOCATIONS routine 210. Initialized to 
that value by adding, to reloc, the number 
of bytes of relocation instruction as 
specified in the relocation header for the 

30 current section. 

rAddr Pointer to the next information item in 
the raw data for the current section. 
Initialized to the memory address of the 
35 beginning of the memory region for the 

current section number. 

rSyml A symbol index (count value) which is used 
to access a corresponding import's address 
40 in the imports [i] table. Initialized to 

the value 0 for each new section. 
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codeA Pointer to the starting memory address of 
. one of the loadable sections in the 
current PEF container. Initialized to 
point to the starting memory address of 
5 section number 0 if section number 0 is 

present and loadable, otherwise it is 
initialized to the value 0. As will be 
seen, certain relocation instructions can 
change the value of codeA, thereby 
10 affecting subsequent relocations by 

relocation instructions that refer to this 
variable . 

Pointer to the memory address of another 
one of the loadable sections in the 
current PEF container. Initialized to 
point to the starting memory address of 
section niimber 1 if section number 1 is 
present and loadable, otherwise 
initialized to the value 0. As with code 
A, certain relocation instructions can 
change the value of dataA, thereby 
affecting subsequent relocations by 
instructions that refer to this variable. 

A counter variable used in the repeat 
instructions, described below. 
Initialized to the value 0. 

30 The abstract machine variables codeA and dataA do 

not attempt to define which section is a code section 
and which section is a data section in the container, 
except to supply initial default values for these 
variables. The variables can point to any of the 

35 container's loadable sections, and can be changed as 
often as desired through the use of the relocation 
instructions . 

Note that although codeA and dataA are described 
above as pointing to the memory address of a loaded 
40 section, they do so by storing a value equal to the 
memory address of the loaded section minus the default 



dataA 

15 



20 



25 

rpt 



address for that section. The default address for a 



wo 94/22078 PCT/US94/02952 



- 37 - 

section is obtained from the desired Section Address 
field in that section's Section Header. The default 
address for the various sections are typically 0, 
however, so the value contained by the codeA and dataA 
5 variables will typically equal the starting address of 
the desired section in memory. 

After the abstract machine variables are 
initialized, a decision is made as to whether there are 
any (further) relocation instructions corresponding to 

10 information items within the current section (step 506) . 
This decision is made by testing whether reloc is less 
than rlEnd. If not, then the relocations are conplete 
for the current section and PERFORM RELOCATIONS routine 
210 is exited (step 508) . 

15 If more relocation instructions exist for the 

present section, then the next relocation instruction is 
retrieved and copied into a variable r (step 510) . 
(Note that in another embodiment, the relocation 
instruction may be retrieved, for example as part of a 

20 step which tests its contents, without actually copying 
it into a variable for subsequent reference.) The 
variable reloc is then incremented by two bytes in a 
step 512. As will be seen, most of the relocation 
instructions are two bytes in length, although a few of 

25 them are four bytes in length. In step 510, only the 
first two bytes are copied into r. After step 510, 
reloc points to the next relocation instruction if the 
current one is only 2 bytes in length, or to the second 
half-word of the current relocation instruction if it is 

30 4 bytes in length. 

All of the instruction formats begin with an opcode 
in the high- order bits. Referring to Fig. 5, the opcode 
is used in a "switch" step 514 to pass control to one of 
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a plurality of routines depending upon the contents of 
the opcode portion of the relocation instruction now in 
the variable r. In the present embodiment, all 
information items to be relocated are assumed to contain 
5 one or more relocatable references, each in the form of 
a 4 -byte virtual address field, and relocation is 
accoitplished by adding the contents of some specified 
pointer (a pointer to the start of a region in memory or 
a pointer taken from an import) to the relocatable 

10 reference. 

After completion of the particular routine called 
for by the opcode in the retrieved relocation 
instruction, control returns to the decision step 506 in 
order to determine whether any further relocation 

15 instructions have been provided for the present section. 
If so, then the next relocation instruction is retrieved 
and the loop continues. If not, then the PERFORM 
RELOCATIONS routine 210 is exited. 

20 RELOCATION INSTRUCTIONS 

Relocation instructions in the present embodiment 
are objects that are packed as opcode and arguments. 
Some of the relocation instructions act as directives 
for the abstract machine itself, while others instruct 

25 that a relocation needs to be performed, and others 
instruct that many relocations of a particular type are 
to be performed. The various field definitions for the 
different types of instructions are fully set forth in 
the following C language typedef. 
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typedef union { 



10 



15 



20 



25 



30 



35 



struct 
struct 
struct 
struct 
struct 
struct 
struct 
struct 



Reloclnstr instr; 
T0nsignedl6 bot; 
} Relocation; 



unsigned op: 7, rest:9; 

unsigned op: 2, delta_d4:8, cnt:6; 

unsigned op: 7, cnt_ml:9; 

unsigned op: 7, idx:9; 

unsigned op:4, delta_ml:12; 

unsigned op:4, icnt_inl:4, rcnt_ml:8; 

unsigned op: 6, idx_top:10; 

unsigned op: 6, cnt_ml:4, idx_top:6; 



opcode ; 

deltadata; 

r\in; 

gip; 
delta; 
rpt; 
largel ; 
large2 ; 



' Before proceeding, it will be useful to understand 
the structure of a data section 600 generated by a 
typical compiler. Such a structure is illustrated in 
Fig. 6. Grouped at the start of the data section 600 is 
initialized user data in a portion 602. Following the 
initialized user data portion 602, compilers typically 
place a group of contiguous transition vectors (also 
known as procedure descriptors) in a region 604. 
Transition vectors are used by external callers to 
procedures which are referenced by the transition 
vectors. Thus transition vectors are common especially 
in executable object code files produced on compilation 
of libraries. The compiler typically creates a 
transition vector for each procedure compiled, although 
by the time relocations are being performed by a loader 
program, the transition vectors for procedures which are 
never called by external callers typically have been 
eliminated. 

Transition vectors are usually created in one of two 
formats, indicated in Fig. 6 as formats 606 and 608, 
respectively. A given coii:5)iler typically produces 
transition vectors in only one of the two formats 606 or 
608. In format 606, each transition vector contains 
three 4 -byte words. The first word is a pointer 610 to 
the start of a procedure in the code section, and the 
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second word 612 is a pointer to a known location in the 
TOC for the* data section 600. The third word 614 
contains an "environment word" which is rarely used cuid 
is not important for an understanding of the present 
5 invention. As created by the compiler, the pointer 610 
contains an offset into the code section for the desired 
procedure and pointer 612 contains an offset into the 
data section 600 usually for the start of the TOC. Thus 
the first word of each transition vector 606 must be 

10 relocated by the loader program through the addition of 
the start address of the code section in the current 
container. This is referred to as a "code -type 
relocation". Similarly, the second word of each 
transition vector 606 must be relocated by the loader 

15 program through the addition of the start address of a 
data section, typically the same data section 600 which 
contains the transition vector 606. This is known as a 
"data-type relocation". The environment word 614 in 
each of the transition vectors 606 does not need to be 

20 relocated. 

Transition vectors of the second type 608 are the 
same as those of the first type 606, except that the 
third word of each transition vector is omitted. 

Following the transition vectors 604, compilers 

25 typically place a table of pointers, also known as a 
table of contents (TOC) in a portion 616 of the data 
section 600. The TOC contains various pointers 618 into 
the code section, pointers 620 into the data section 
600, and pointers 622 to external symbols which are to 

30 be imported. These pointers are typically intermixed, 
although it is frequent that several of one type are 
grouped together. As produced by a compiler, pointers 
into the code section carry only the offset into the 
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code section and pointers into the data section carry 
only the offset into the data section. These types of 
pointers 618 and 620 therefore require code and data- 
type relocations, respectively. Pointers 622 to 
5 external symbols typically contain 0 upon completion of 
compilation, although they are permitted to contain an 
offset to which the address of the desired symbol will 
be added. The latter address is taken from an 
appropriate entry in the imports [i] table created by the 
10 RESOLVE IMPORTS routine 208 (Fig, 4) . This type of 
relocation is referred to herein as a "symbol -type 
relocation" . 

Considering first the code-type relocations, it can 
be seen that the relocation table entries to accon^lish 

15 such relocations in conventional executable object code 
formats require 12 bytes of relocation information for 
each 4 -byte information item to be relocated. This is 
due in part to the fact that conventional format 
requires all of the following information to appear in 

20 each 12-byte relocation table entry: the section number 
containing the information item to be relocated, the 
offset within that section of the information item to be 
relocated, the section number whose starting address 
must be added to the information item, and an indication 

25 that the information item is a 32 -bit word. In the 
present embodiment, on the other hand, most of this 
information either is maintained in one of the abstract 
machine variables, or is inherent in the relocation 
instruction opcode. In particular, the offset in the 

30 current section of the information item to be relocated, 
is maintained in the abstract machine variable rAddr 
together with the section number containing the 
information item to be relocated. Similarly, the number 
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Of different relocatable address fields in the 
information item to be relocated, and the fact that each 
is to be treated as a 32 -bit pointer are inherent in the 
relocation instruction opcode, since a separate routine 
5 can be provided to perform the relocations specified by 
each relocation instruction opcode. Additionally, the 
section number whose starting address in memory is to be 
added to the offset value in the relocatable address 
field is also specified inherently by many of the 

10 relocation instruction opcodes. 

Moreover, whereas the conventional executable object 
code file format requires one relocation table entry for 
each address field to be relocated, the loader program 
of the present embodiment takes advantage of the fact 

15 that as mentioned above, several relocations of the same 
type are often grouped together (as in. the TOC) . That 
is, a coitpiler may generate five pointers 618 into the 
code section sequentially, followed by four pointers 620 
into the data section, followed by seven pointers 622 to 

20 external symbols, followed by more pointers 620 into the 
data section, and so on. The loader program of the 
present embodiment takes advantage of this 
characteristic by providing a class of relocation 
instructions which specify a type of relocation (e.g. 

25 code, data, symbol) together with a count of the number 
of such relocations to perform sequentially (i.e. in a 
"run") . 

Many of the relocation instructions which 
accommodate a run of similar type relocations follow the 
30 "run" instruction format illustrated in Fig. 7A. in 
this format, the high- order three bits carry an opcode 
of 010, the next four bits carry a subopcode indicating 
the type of relocation to perform, and the low- order 
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nine bits carry a count CNT9 indicating the number of 
consecutive ^ information items to be relocated in the 
manner specified by the subopcode. 

One such relocation instruction type is CODE which, 
5 in a step 516 (Fig. 5) adds the value in abstract 
machine variable codeA to the information item specified 
by the machine variable rAddr for CNT9 consecutive 
information items. rAddr is incremented by four after 
each individual one of the relocations so as to point to 

10 the next information item in the run. Another such 
relocation instruction type is DATA which, in a step 518 
adds the value in abstract machine variable dataA to the 
information item specified by the abstract machine 
variable rAddr for CNT9 consecutive four- byte 

15 information items. 

Another relocation instruction type, SYMR, performs 
a run of symbol -type relocations. For each of the 
symbol -type relocations in the run, the information item 
specified by rAddr is relocated by adding the value from 

20 imports [rSyml] , both rSyml and rAddr being incremented 
by 1 and 4, respectively, after each such relocation 
operation in the run. As mentioned, rSyml is 
initialized to zero at the start of processing a 
section. Unless specifically reset by an SYMB or LSYM 

25 instruction (described below) , rSyml increases 
monotonically as the section is processed. 

It can be seen that the availability of these three 
relocation- type instructions (CODE, DATA and SYMR) 
themselves can drastically reduce the number of bytes of 

30 Relocation Table information which are required to 
perform the required relocations on information items 
contained within a TOC. The DESC and DSC2 relocation 
instruction types can provide an even more drastic 
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reduction. As previously mentioned, in a typical 
executable object code file, all of the transition 
vectors are grouped together in a portion 604 (Fig. 6) . 
For transition vectors of type 606, each transition 
5 vector requires a code- type relocation, followed by a 
data- type relocation, followed by no relocation. For 
transition vectors of type 608, each transition vector 
requires a code- type relocation followed by a data- type 
relocation. In either case, the conventional executable 

10 object code format required 24 bytes (two 12 -byte 
relocation table entries) to specify a transition vector 
relocation. In the present eiribodiment , on the other 
hand, a run- type relocation instruction, referred to 
herein as DESC, is provided for relocating a run of 

15 transition vectors of type 606. The routine is 
implemented in DESC step 522 (Fig. 5) in which, for each 
three -word information item 606 in the run, the value in 
abstract machine variable codeA is added to the word 
pointed to by rAddr and rAddr is post- incremented by 

20 four. The value in abstract machine variable dataA is 
then added to the word pointed to by rAddr, and rAddr is 
post -incremented again by four. rAddr is then 
incremented by four a third time to skip over the 
environment word 614. This procedure is repeated for 

25 each information item in the specified run. 

The DSC2 relocation instruction type is similar to 
the DESC relocation instruction type, except that it is 
adapted for use on transition vectors of type 608. DSC2 
instructions are processed in step 524 by a routine 

30 which is identical to the DESC routine 522 except for 
the omission of the third incrementing of rAddr. 

Accordingly, it can be seen the relocation of an 
entire region 604 of transition vectors can be performed 
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using a .loader program of the present embodiment, in 
response to> a single 2 -byte relocation instruction. 
This represents a drastic savings over conventional 
loader programs. 
5 Within the above framework, numerous types of 

relocation instruction can be developed to simplify 
relocations on other commonly occurring relocatable 
information item formats as they are recognized. For 
example, one relocatable information item format which 
10 is frequently produced in the initialized user data area 
602 by C++ compilers is in the form of a one- word 
pointer to an object in the data section, followed by 
one word of nonrelocatable data. Thus in the present 
embodiment, a VTBL relocation instruction type is 
15 provided and processed in a step 526 which adds the 
value in the abstract machine variable dataA to the 
first word of each information item in a run, and leaves 
the second word of each information item in the run 
unchanged. Many other information item formats can be 
20 accommodated by other relocation instruction types. 

Certain relocation instruction types modify the 
abstract machine variables without performing any actual 
relocation operation. For example, a DELTA- type 
relocation instruction merely adds a specified value to 
25 rAddr, thereby effectively skipping over a niimber of 
information items which are not to be relocated. The 
format of the DELTA instruction is illustrated in Fig. 
7B, and in particular includes a 4 -bit high- order opcode 
of 1000, followed by a 12 -bit DELTA value B12 . DELTA 
30 instructions are processed in a step 528 (Fig. 5) , which 
merely adds B12 + l to the value in abstract machine 
variable rAddr. 
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-The GDIS instruction also modifies the value of an 
abstract machine variable without performing any 
relocations. In particular, it loads the codeA variable 
with the starting memory address of a specified one of 
5 the loadable sections in the present PEP container. The 
CDIS instruction follows the format of Fig. 7C and 
contains a high- order 3 -bit opcode of Oil, followed by 
a 4 -bit subopcode indicating the CDIS instruction, 
followed by a 9 -bit IDX9 field for specifying the 

10 section number whose start address is to be loaded into 
codeA. The CDIS instruction is processed in step 530 
(Fig, 5), which merely copies the pointer in 
regions [IDX9] into the abstract machine variable codeA. 
A similar instruction which follows the same opcode 

15 format as CDIS is the instruction DTIS, processed in a 
step 532 (Fig. 5) . The step 532 merely copies the value 
in regions [IDX9] into the abstract, machine variable 
dataA. The CDIS and DTIS instructions are most useful 
when multiple code and/or data sections are produced by 

20 the compilation system. 

Certain of the relocation instruction types 
supported in the present embodiment both perform a 
relocation operation and also further modify one of the 
abstract machine variables. For example, the DDAT 

25 instruction first increments rAddr by a specified number 
of words (similar to DELTA) , and then performs a 
specified number of contiguous data- type relocations 
(similar to DATA) . The DDAT instruction follows the 
format illustrated in Fig. 7D, which calls for a 2 -bit 

30 high-order opcode of 00, followed by an 8-bit W8 field 
indicating the number of words of information items to 
skip, followed by an N6 field indicating the number of 
data- type relocations to subsequently perform. DDAT 
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instructions are processed in a step 534 (Fig. 5) , in a 
routine which first increments rAddr by four times the 
value in W8, and then adds the value in dataA to each of 
the N6 following information items. rAddr is 
5 incremented after each such relocation operation such 
that when the instruction- is completed, rAddr points to 
the next information item after the run. 

SYMB is another instruction which modifies an 
abstract machine variable and then performs a relocation 

10 operation. The SYMB instruction follows the fomiat of 
Fig. 7C, and causes the abstract machine to first load 
rSyml with a new symbol number specified by IDX9, and 
then (similarly to the SYMR instruction for a run count 
of 1) add the value in imports [rSyml] to the relocatable 

15 information item then pointed to by rAddr. Both rSyml 
and rAddr are then incremented. The SYMB instruction is 
processed in a step 536 (Fig. 5) . 

The SECN instruction, which also follows the format 
of Fig. 7C, performs a single relocation to a section 

20 whose number is specified in IDX9. In step 538 
(Fig. 5), the routine for processing SECN instructions 
merely adds the value in regions [IDX9] to the 
information item pointed to by rAddr, and increments 
rAddr. 

25 The RPT instruction permits further compaction of 

the relocation table by providing a means of repeating 
a series of relocation instructions on a specified 
number of consecutive groups of information items. The 
RPT instruction follows the format of Fig. 7E, which 

30 includes a high- order 4 -bit opcode of 1001, followed by 
a 4 -bit number 14, followed by an 8 -bit CNT8 field. 
When invoked, the abstract machine repeats the most 
recent 14 relocation instructions, CNT8 times. The RPT 
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instruction is processed in a step 540 (Fig. 5) which 
first decrements the abstract machine variable rpt and 
tests for its value. If rpt is now less than 0, this is 
the first time that the RPT instruction was encountered 
5 and rpt is initialized to CNT8 + 1. Then, regardless of 
the value of rpt, the abstract machine variable reloc is 
decremented by the number of half-words specified by 14 
and the routine is exited. This has the effect of 
causing the abstract machine to back up by 14 half-words 

10 of relocation instructions, and to repeat them. Note 
that since rAddr is not decremented, the repeated 
relocation instructions will operate on the next group 
of unrelocated information items (rather than on a 
repeated ' group of information items). If the initial 

15 decrementing of rpt brought that eUds tract machine 
variable to 0, then the RPT instruction is determined to 
be complete and the routine 540 is exited. 

The number specified in 14 is a number of half-words 
rather than a number of relocation instructions. Thus 

20 if the repetition will include one or more 4 -byte 
relocation instructions, then the value in 14 must 
include an additional count for each of such 4 -byte 
relocation instructions. Also, the value in 14 is one 
less than the niomber of half-words to repeat (that is, 

25 14 s 0 specifies a one half-word repeat, 14 = 1 
specifies a two half-word repeat, etc.) 

The LABS instruction is a 4 -byte relocation 
instruction which follows the format of Fig. 7F. 
Specifically, the high- order 4 bits contain an opcode of 

30 1010; the next two bits contain a subopcode of 00; and 
the low-order 26 bits carry a value OPFSET26. The LABS 
instruction is processed in a step 542, which adds the 
value 0PFSET26 to the start address of the section whose 



wo W/22078 PCTAJSW/02952 



- 49 - 

information items are currently being relocated, and 
loads the result into abstract machine variable rAddr. 
Thus LABS performs a "jump" to a specific information 
item within the current section. Since the lABS 
5 instaruction is a 4-byte instruction, the routine 542 
concludes by incrementing reloc by another two bytes. 

The LSYM instruction also follows the format of 
Fig. 7F, with a subopcode of 01. It performs the 
functions of setting a new symbol number in rSyml, and 

10 subsequently performing a symbol -type relocation on the 
information item then pointed to by rAddr. Both rSyml 
cind rAddr are then post -incremented. The new symbol 
number is specified in the LSYM instruction in the field 
OFFSET26. The LSYM instruction is processed in step 

15 544. 

The LRPT instruction follows the format of Fig. 7G, 
in which the high -order four bits carry an opcode of 
1011, the next two bits carry a subopcode of 00, the 
next four bits carry a CNT4 value and the last 22 bits 

20 carry a COT22 value. LRPT is similar to RPT, but allows 
for larger repeat counts. Specifically, when the LRPT 
step 546. is invoked, the last CNT4 2 -byte relocation 
instructions are repeated the number of times specified 
in CNT22. Again, since LRPT is 4-byte instruction, step 

25 546 concludes, after the last repetition, by 
incrementing reloc by another two bytes. 

The instruction format of Fig. 7G is also used for 
three other 4-byte relocation instructions where the 
subopcode is 01 and CNT4 specifies which of the three 

30 instructions are to be invoked. The three instructions, 
all processed by LSECs step 548 (Fig. 5), are LSECN 
(which is the same as SECN but with a very large section 
number) , LCDIS (which is the same as GDIS, but with a 
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very large section number) , and LDTIS (which is the same 
as DTIS, but' with a very large section number). Again, 
since these instructions are 4 -byte instructions, LSECs 
step 548 concludes by incrementing reloc by another two 
5 bytes . 

If the opcode of the current relocation instruction 
pointed to by reloc is not recognized, then default step 
550 indicates an error and breaks. 

10 CONSTRUCTING A PEF FILE 

A PEF file in the format described eUbove can be 
created directly from compilers. Alternatively, a 
conventional compiler can be used to create a 
conventional format executable file, and a conversion 

15 tool used to convert the conventional format executable 
file into a PEF format file. As an illustration of 
procedures which may be used to create a PEF format 
file, Appendix B is C- language program which does so 
beginning from an XCOFF- format file. The program will 

20 now be described roughly by reference to flowchart Figs. 
9-15, it being understood that a more precise 
understanding of the procedures can be found in 
Appendix B. 

Fig. 9 illustrates the overall flow of the program. 
25 In an initialization step 902, the XCOFF file is opened 
and read into memory. The various XCOFF sections are 
located in memory and their addresses stored in 
variables . 

In a step 904, the import symbols, export symbols 
30 and import file Ids are extracted from the XCOFF file 
(in -memory copy) and converted to the organization used 
in the PEF file format. Fig. 10 is a flowchart 
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illustrating the symbol conversion step 904 in more 
detail . 

Referring to Fig. 10, all of the import file Ids are 
first extracted from the XCOFF file. Specifically, it 
5 will be recalled that the import file Ids are all set 
forth in the XCOFF file as variable length strings in 
the Iit^ort File Ids portion of the Loader Section of the 
XCOFF file. The ordinal of these strings represents an 
import file number, referred to in the XCOFF External 

10 Symbol Import/Export Table. The step 1002 thus involves 
copying each of the import file Id strings into the 
newly created PEF Loader String Table, and writing the 
offset of each string into a corresponding entry in the 
Import Container Ids porion of the PEF Loader Section. 

15 In a step 1004, the XCOFF imports are extracted from 

the XCOFF External Symbol Import/Export Table and Loader 
String Table. This is accomplished by traversing the 
XCOFF External Symbol Import/Export Table looking for 
import symbols, writing the import symbol name into the 

20 PEF Loader String Table and an accompanying offset to 
the string into the PEF Import Symbol Table. This 
traversal is performed separately for each import -file 
number, so that the resulting import symbols are grouped 
by import file number. 

25 In a step 1006, eaqport symbols are extracted from 

the XCOFF file. Specifically, the XCOFF External Symbol 
Import /Export Table and Loader String Table is traversed 
looking for export symbols, and for each such export 
symbol, the symbol name is written to the PEF Loader 

30 String Table and an accoit^anying offset is written to 
the PEF Export Symbol Table/ The routine also at this 
time computes the hash value for the export symbol and 
writes it into the corresponding entry of the Export 
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Chain Table. This traversal is performed separately and 
sequentially for different PEF section nximbers, thereby 
resulting in a list of export symbols grouped by PEF 
section number. 

5 In a step 1008, the convert symbols routine 904 

returns to the caller. 

Returning to Fig. 9, in a step 906, the program 
sorts the PEF export symbols as set forth in more detail 
in Fig. ii. As indicated in Fig. 11, in a step 1102, 
10 the routine first computes the size of the Export Slot 
Table given the number of exports contained in the XCOFF 
External Symbol Import/Export Table. In a step 1104, 
the procedure loops through all of the export symbols. 
For each export symbol, the procedure computes its hash 
15 slot value given the Export Slot Table size. The symbol 
is added to a linked list for that hash slot value. The 
Export Slot Table is created in the process, with the 
chain index field of each entry temporarily pointing to 
the linked list for the slot. 
20 In a step 1106, a determination is made whether more 

than a predefined number m of the slots contain more 
than a predefined number n export symbols. If so, then 
the Export Slot Table is too small. The size is doubled 
in a step 1108, and step 1104 is repeated. If the 
25 Export Slot Table is not too small, then the Export 
Chain Table and Export Symbol Table are created from the 
linked lists. Specifically, for each slot in the Export 
Slot Table, the linked list for that slot is walked to 
create these two tables. The chain index field for the 
iO slot in the Export Slot Table is overwritten with the 
correct index into the Export Chain Table. The 
procedure then returns to the caller (step 1112). 
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After the PEF export symbols are sorted (Fig. 9), 
the relocations in the XCOFF file are converted to PEF 
format (step 908) . Fig. 12 illustrates a procedure 
which accomplishes this task. 
5 Referring to Fig. 12, in a step 1202, a 

determination is made as to whether the code and data 
sections of the PEF file have their default numbering. 
If not, then a GDIS and/or DTIS instruction is "emitted" 
in order to set the CodeA and/or the DataA abstract 
10 machine variables. The "emitting" step is described 
below. 

In a step 1204, the routine begins looping through 
all of the XCOFF Relocation Table entries. In step 
1206, it is determined whether the current entry in the 

15 Relocation Table is an import symbol relocation. If so, 
then appropriate SYMR, SYMB or LSYM instruction (s) are 
emitted (step 1208) and control returns to the looping 
step 1204. In step 1210, the routine counts the number 
of successive entries in the Relocation Table which 

20 designate the same relocation type (code or data) for 
successive longwords in the data section, if this count 
is greater than 1, then DDAT, CODE or DATA instructions 
are emitted as necessary (step 1212) and control returns 
to the looping step 1204. 

25 If the run count from step 1210 is s 1, then in step 

1214, a new run count is made of successive pairs of 
entries in the Relocation Table which designate, for 
three successive longwords in the data section, a Code 
type relocation, a Data type relocation, then no 

30 relocation. If this run count is s 1, then DESC 
instruction (s) are emitted (step 1216) and control 
returns to the looping step 1204. 
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If the run count from step 1214 is less than l, then 
a new run count is made of successive pairs of entries 
in the Relocation Table which, for two successive 
longwords in the data section, designate a Code type 
relocation followed by a Data type relocation (step 
1218) . If this run count is a 1, then DSC2 PEP 
instruction (s) are emitted (step 1220) and control 
returns to the looping step 1204. 

If the run count of step 1218 is less than 1, then 
in a step 1222, another run count is made of successive 
Relocation Table Entries designating, for two successive 
longwords in the data section, a Data type relocation 
followed by no relocation. If this run count is a 1, 
then VTBL instruction (s) are emitted as appropriate 
15 (step 1224) and control returns to the looping step 
1204. 

If none of the above- described tests are successful, 
then the Relocation Table entry specifies a lone Data or 
Code type relocation. In a step 1226, the routine 
20 therefore emits a DDAT, CODE or DATA instruction as 
appropriate, and returns control to the looping step 
1204. 

When all of the Relocation Table entries have been 
traversed and converted to PEP relocation instructions, 

25 the routine 908 returns to the caller (step 1228) . 

Fig. 13 illustrates the routine 1208 to dp the SYMR, 
SYMB and LSYM instructions. In a step 1302, a run count 
is made of successive import symbol relocations in the 
Relocation Table for successive PEP import symbol 

30 numbers. Note that in order to use the PEP SYMR/SYMB/ 
LSYM relocation instructions, these import symbol 
relocations must be to successive import symbols as they 
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are n\ambered after the conversion to the PEF format 
(step 904), not before. 

In a step 1304, the delta between the first longword 
to be relocated for this run, and the last longword to 
5 be relocated for the previous PEF instruction, is 
determined. If this delta is greater than 0, then a 
DELTA instruction, or if necessary an LABS instruction, 
is emitted (step 1306) . Then, if the run count is 
greater than l (step 1308), one or more SYMR 

10 instructions are emitted (step 1310) and the routine 
returns to the caller (step 1312) . More than one SYMR 
instruction can be emitted if the run count is larger 
than the largest run count which can be represented in 
the run count field of the SYMR instruction. 

15 If the run count was equal to 1 (step .1308), then in 

order of preference, either an SYMR, SYMB or LSYM 
instruction is emitted (step 1314) and the routine 
returns to the caller (step 1312) . 

Fig. 14 sets forth a procedure to do the DDAT, CODE 

20 and DATA instructions 1212 (Fig. 12) . The procedure is 
reused in step 1226 (Fig. 12). Referring to Fig. 14, a 
determination is first made (step 1402) as to whether 
the run of entries is a run of Data type relocations. 
If so, then it is determined whether a DDAT instruction, 

25 possibly preceded by DELTA instruction, can successfully 
represent the run (step 1404) . The maximum run count 
which can be represented in a DDAT instruction is 
considered here, as are data alignment requirements. If 
such instruction (s) will work, then they are emitted 

30 (step 1406) and the procedure returns (step 1408) . 

If the DELTA/DDAT instructions will not accurately 
represent the run (step 1404) , or if the run is a run of 
Code type relocation (step 1402), then in a step 1410, 
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CODE or DATA instruction ( s ) are emitted as required. 
The step 1410 may be preceded by the emission of a DELTA 
or LABS instruction as required (step 1412) . The 
routine then returns to the caller (step 1408) . 
5 The step 1216 (Fig. 12) of doing DESC instructions 

is acconiplished with a procedure set forth in Fig. 15. 
Referring to Fig. 15, if a DELTA or LABS instruction is 
required prior to the DESC instruction, then it is 
emitted in a step 1502. The DESC instruction (s) are 

10 then emitted in step 1504, and the routine returns to 
the caller in step 1506. The routines in Fig. 12 to do 
DSC2 instructions (step 1220) and VTBL instructions 
(step 1224) are identical to that set forth in Fig. 15, 
except that in step 1504 (Fig. 15), DSC2 instructions or 

15 VTBL instructions are emitted instead of DESC 
instructions. 

In the above description of the routine to convert 
the relocations from XCOFF entries to PEF relocation 
instructions (Fig. 12), whenever it has been stated that 

20 a PEF relocation instruction is "emitted", the emitted 
relocation instructions could be placed in a temporary 
buffer. At a subsequent time, the temporary buffer 
could be scanned for repeating patterns and RPT or LRPT 
instructions generated therefor. 

25 Another technique for generating the RPT and LRPT 

instructions is to identify patterns as they are being 
created. In this techniques, PEF instructions which are 
"emitted" by the relocation conversion routine 908 are 
held in a first-in, first-out (FIFO) buffer until the 

30 buffer is full or a pattern is detected. If the buffer 
is full, then the first PEF instruction in the buffer is 
outputted, and the newly emitted instruction is added to 
the end; the routine then returns. If a newly emitted 
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instruction starts or continues a partial match of 
instructions in the buffer, then a match size variable 
merely increments. If the newly emitted instruction 
completes a match, then the match size variable returns 
5 to zero and a repeat count variable increments. If the 
newly emitted instruction breaks a match, then the 
common instructions of the pattern are first outputted 
and deleted from the buffer, and then an appropriate RPT 
or LRPT instruction is outputted. The newly emitted 

10 instruction is then added to the buffer. Note that 
instructions outputted by this pattern matching process 
are still not output to the final PEF file; they are 
merely stored sequentially in a final in-memory version 
of a PEF Relocations buffer. 

15 Returning to Fig. 9, after all of the entries in the 

XCOFF Relocation Table have been converted to PEF 
relocation instructions (step 908) , the data sections 
themselves are packed using the pattern- initialized data 
(pidata) format described above (step 910) . In step 

20 912, now that all of the PEF section sizes have been 
established, offsets are computed and all of the headers 
are filled with the necessary information pointing to 
the different sections (step 912). In step 914, each 
block of the PEF file is written out, with padding bytes 

25 inserted as necessary - to meet data alignment 
requirements . 

The invention has been described with respect to 
particular embodiments thereof, and it will be 
understood that ntimerous modifications are possible 
30 without departing from its scope. For example, the 
invention may be applied to relocatable object code file 
formats produced by a compiler prior to linking, 
although the invention is less useful in that context 
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due to the relative scarcity of repeated patterns of 
relocations to be performed. 
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APPENDIX A (37 C.F.R. 61 . 9fi (a) f 2 W± ^ 
Copyright: 1993 Apple Computer. Inr. 

OSErr PLPrepareRegion ( TPLPrivatelnfoPtr ourPUnfo, 
5 TCount rcglndcx, 

TAddress regAddress ) 

{ 

register TAddress '*'raddr; 

register TOffset dataA; 
10 register int cnt; 

register TOffset codeA; 

LoadcrRcIExpHeadcrPtr IdRelHdrPtr; 

Relocation ♦reloc, *rlend; 

Relocation r; 
15 long ipt; 

long seen; 

longrsymi; 

TOffset ^imports; 

TOffset ^regions; 
20 longi; 

longrelNum; • 

TOffset restart; 

int err - noErr; 
iKf NO GLOBALS 
25 #include "PEFtbl.h- 
je^endif 



if (ourPUnfo = = NULL) return parwnErr; 
30 if ((reglndcx < 0) | J (reglndex > = ourPUnfo- >numRegions)) return pararaErr: 

if (ourPUnfo- >scclions[regIndex].regionKind == IcLoaderSection) return paramErr; 
if (ourPUnfo- >sections[regIndex].regionKiDd -= kPlDataSection) return unimpErr; 

ourPLInfo-> resolved = 1; 

35 

for (i = 0; ; i + +) { 

if (i > = ourPUnfo - > numRegions) return noErr; Z+no relocations for this data section*/ 

IdRelHdrPtr = Sl ourPUnfo •> IdSections [i]; 

if (IdRelHdrPtr -> sectionNumber reglndex) break; 

40 } 

regions - ourPUnfo -> regionDcltas; 
imports = (TOffset *) ourPUnfo -> imports; 

45 regStart = (TOffset) regAddress; 

// subtract old default address from region address 

// region array will hold deltas from new address to old address 
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10 



// for (i = 0; i < ourPUnfo -> numRegions: 

// regions [i] -= ourPLInfo -> sections [ij. sectionAddress; 

// the regions array already holds deltas - no need to change it 

rebels (Relocation*) (ourPUnfo • > IdRelocations + IdRelHdrPtr *> relocationsOffset); 

ricnd = (Relocation ♦) ((Reloclnstr ♦) reloc + IdRclHdrPtr -> numRelocations); 

raddr = (TAddress *) regStart: 

rsymi = 0; 

codeA = regions [0]; 

dataA = regions [1]; 

rpl = 0; 



15 



20 



25 



30 



relNum = 0; 
while (reloc < riend) { 
r = *reloc; 

reloc = (Relocation *) ((Reloclnstr *) reloc + 1); 
switch ( opcode [r. opcode. op] ) { 
case krDDAT : raddr = (TAddress *) 

((TAddress) faddr + r.deltadata.delta_d4 * 4); 
cnt = r.deltadata.cnt; 
while (~cnt > = 0) { 
// printf C(0004:«06Ix)(#%4d) l\n", (char *) raddr - regStart, relNum++); 
*raddr+ + + = dataA; 

} 



continue; 



1; 



case krCODE : cnt = r.run.cnt ml + 
while (-cnt > = 0) { 
// printf (-(0004:%061x) (#%4d) OXn", (char *) raddr - regStart, relNum++); 
*raddr+ + + = codeA: 

} 

continue; 



case krDATA : cnt = r.nm.cnt_ml + 1; 

while (~cnt > = 0) { 

35 // printf (•(0004:%061x) (if%Ad) l\n", (char *) mddr - restart. reINum+ +); 

*niddr++ += dataA; 

} 

continue; 



40 case krDESC : cnl - r.nin.cnt_nil + 1; 

while (-cnt > = 0) { 
// printf ("(0004:%061x) (#%4d) OVn", (char *) raddr 

♦raddr+ + + = codeA; 
// printf ("(0004: %061x) (Jf.%4d) l\n', (char *) raddr 
45 ♦raddr++ += dataA; 

raddr++; 

) 

continue; 



regStart, relNum+ +); 
regStart, rclNum+ +); 
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case krDSC2 ; cnt = r.nin.cnt ml -k- I; 

while (--cnt > = 0) { 
// printf (-(00(M:%Q61x) (#%4<1) 0\n". (char ♦) raddr - regStart. relNum -»■+): 
*raddr+-l- += codeA; 
5 // prinlf (-(OCXW: %061x) (#%4d) I\n*, (char *) raddr - regSUrt. relNuni+ +); 

♦raddr++ += dataA; 

} 

contiDue; 

10 casekrVTBL: cnt - r,niii.ciit_ml + 1; 

while (-cnt > = 0) { 
// printf ("(0004: %061x) (#%4d) I\n", (char ♦) raddr - regStart, relNum++); 
*raddr+ + + = dataA; 
raddr+ + ; 

15 ) 

continue; 

casekrSYMR : cnt = r.mn.cnt ml + 1; 

while (-cnl > = 0) { 

20 // printf (-(0004: %061x) (#%4d) %d\n". (char *) laddr - regStart, relNuin+ + . rsymi); 

*raddr+-l- += imports [rsymi + +]; 

} 

continue; 

25 casekrSYMB: rsymi r.glp.idx; 

// printf ("(0004:%06lx) (#%4d) %d\n", (char *) raddr - regStart^ relNum++, rsymi); 
*raddr-K + -I- = imports [rsymi + +]; 
continue; 

codeA = regions [r.glp.idx}; 
continue; 

dataA = regions [r.glp.idx); 
continue; 

*raddr+ + -I- = regions [r.glp.idx]; 
continue; 

raddr = (TAddress *) 

((TAddress) raddr + r.delta.delta_ml 4- 1); 
continue; 

if (-rpl ==0) continue; // count was I -> rpt done 
if (rpt < 0) // first time rpt encountered? 

ipt = r.rpl.rcnt_ml .+ 1 ; // yes- initialize rpt count 
cnt = r.rpt.icnt^ral + 2; // yes or no - back up cnl instis 
reloc = (Relocation ♦) ((Reloclnstr *) reloc - cnt); 
continue; 



30 casekrCDIS: 
case krDTIS : 

35 

case krSECN 



case krDELT ; 

40 



casekrRFT : 

45 
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case krLABS : laddr = (TAddress ♦) 

((r.largel.idxjop < < !6) + rcloc -> boi + regStart); 
. reloc = (Relocation *) ((Reloclnstr *) reloc + I); 
continue: 

5 

case krLSYM : rsymi = (r.largel.idxjop < < 16) + reloc -> bot; 

reloc (Relocation *) ((Reloclnstr *) reloc +1); 
// printf ("(0004:«061x) (#%4d) %d\n". (char ♦) raddr - regStart. rclNum+ + , rsymi); 
*raddr++ += imports [rsymi ++}; 
10 continue: 



15 



20 



case krLRPT : if ( -rpt = = 0) { 

reloc = (Relocation *) ((Reloclnstr ♦) leloc + 1); 
continue; 

} 

if (rpt < 0) 

rpt = (r.laige2.idx_top < < 16) + reloc -> bot; 
cnl = r.lBrge2.cnt_ml + 2; 
reloc = (Relocation ♦) ((Reloclnstr *) reloc - cnt); 
continue; 



case krLSEC : 



25 



30 



35 



40 



) 



default : 
} 



seen = (r.large2.idxJop < < 16) + reloc -> bot; 
switch (r.laige2.cnt_ml) { 

case 0 : ♦raddr ++ + = regions [seen]; break; 

case 1 : codeA = regions [seen]; break; 

case 2 : dataA - regions [seen]; break; 

} 

reloc = (Relocation *) ((Reloclnstr *) reloc + 1); 
continue; 

err = unimpErr; 
break; 



// restore actual region addresses to the regions array 
// instead of region deltas 

// for (i = 0; i < ourPUnfo -> numRcgions; i++) 

// regions [i] + = ourPLInfo -> sections [i]. sectionAddress; 

// the region array now always holds deltas - don't want to change it back 



return err: 
) /* PLPrepareRegioD 0 */ 
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20 



25 



APPENDIX B (37 H .F.R. 51,96(aW2Wn 
Copyright 1992-X993 Annie Cc^mnnt^^r. Tnr, 



File: NakePEF.c 

contains: XCOFF to PEF converter tool 

Upitten by: Erik L Eidt 

Copyright: - 1992- 19« by Apple Computer, Inc., all ri*ts reserved. 



10 Ji^include <Errors,h> 

^include <Me(nory.h> 

#include <Ftle8.h> 

^include <string.h> /• jma */ 

15 #if MAC 

tfinclude <stdio.h> /* jma */ 
#end1f 



/* #1nelude <stdLib.h> /* jma V 

#if MAC 

^include <CursorCtl.h> 
#endH 



// for some constants Uke kDefVersion: 
#include <CFLoader.h> 



#include «XCOFF.h» 
30 #inclijde "PEF.h" 

typedef unsigned char byte; 

Mefine TRUE 1 
35 Mefine FALSE 0 

byte *MaUoc < long len ); 

long Write ( int fd, byte *buf, long len >; 

40 byte *PartialNan>e ( byte *n8me ); 

OSErr MakePStr ( byte *src, int len, byte *trg, int max )• 

void movebytes ( byte *s, byte *t, long length ); 

void elearbytes < byte •t, long length ); 

int coRiurebytes ( byte *s, byte *t, int length ); 
*ko int openRead ( byte *fNaine. byte **buf, lor« "len ); 

long Getint ( byte *ptr, byte **out, byte *end ); 

Int mchlen ( byte •str, byte •mch ); 

int LogZ ( unsigned long aln ); 

TBiolee^'^Mr?; ^ *"««^engthp, TVersion ninkedVer, TVersion -oldlBirt^er, 

int CheekXCOFF^C byte •theContainer ); 
int FindStuff < byte •theContainer ); 
int CodiputeOffsets ( void ); 
55 int FillHeaders < byte •fnFlleName ); 

long MapSecNunToPeff ( long secNm, long 'defAddr ); 

long RelocAddr ( TLdRelPtr reloc ); 
long Reloclndex ( TLdRelPtr reloc ); 
OO void dodelta ( long relpos, long curpos ); 

int RelCvt ( TLdRelPtr reloc, long count ); 
int DataAdJust ( TLdRelPtr reloc. long count ); 
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25 



30 



long strZlong ( byte **str ); 



Boolean TryPackingOata ( void }; 

OSErr PackOata ( register byte *ptr, register byte *end, byte *target, byte **tend ); 
5 OSErr UnpackOata ( register byte *ptr, register byte *end, register byte *trg, byte *tend 

); 

void PrtReloc ( Reloelnstr u1 ); 
void checicHateh ( Reloelnstr w1, int matchStart ); 
10 void RestartHatch ( Retoclnstr m1 ); 

void AddReloc ( Reloelnstr Ml. int matchstart }; 
long HacTime ( long unixTime ); 
long GetFileTime ( byte •filename ); 

15 /* the following were not declared, henee generated Harnings... Jnia •/ 

int ProcessExportList ( void ); 

void CoinputeSections ( void ); 

int SynCvt < TLdSymPtr IdSyms, long symCnt ); 

int SyroSort ( void ); 
20 int UriteFile < byte *outFi leName ); 

void Free < byte *tofree ); 

long FindXCOFFIndexByName ( byte *naine, TLdSymPtr IdSyms, long syirCnt ); 
/* ****************************************************** I 



/• A********************************************************* * ^ 

void convertString2Long (byte* theString, unsigned long* theLong); 
********************************************************** «^ 



#deffne badUnitErr -48 



// The default version value is now private to this tool. 
// This is a legitimate version, not the wildcard value. 
35 #define kDef Version 0 

long ALIGNMENT » 16; 
long oldDefVersion = kDef Version; 
long oldlcnpVersion s kDefVersion; 
40 long currentversion * kDefVersion; 
byte dataSection = kDatasection; 
byte shareOption s kContextShare; 

Mefine CODE INDEX 0 
45 ^define DATA INDEX 1 
tfdefine BSSIMDEX 2 
Mefine kSpecialSymbolCount 3 

int secNun (31; 
50 long locNum [3); 



// XCOFF stuff 

55 TFileHdrPtr fileHdrPtr; 

TAuxHdrPtr auxHdrPtr; 

TScnHdrPtr scnHdrs; 

TScnHdrPtr IdrScnHdr; 

TScnHdrPtr codeScnHdr; 

60 TScnHdrPtr dataScnHdr; 

TScnHdrPtr bssScnHdr; 

TLdHdrPtr IdHdrPtr; 

TLdSynoPtr IdSyms; 
65 long symCnt; 
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TLdRelPtr 
long 


IdRelocs; 
relCnt; 


5 


byte 


*LdStrin8?; 




long 
long 
byte 


codeVAddr; 
. codeLen; 
*codeStart; 


10 
15 


long 
long 
byte 
long 
byte 


dataVAddr; 

dataLen; 
*dataStart; 

packedDataLen; 
*packedData$tart; 


byte 
long 
long 


*bssStart; 
bssVAddr; 
bssLen; 


20 


long 


dbsLen; 




TLdSymPtr renSyn (3); 


25 


// PEFF stuff 



Reloclnstr ^relocations; 
long relUordCnt s O; 



byte *stringTable; 
30 Long stringTabSize = 0; 
long stHngTabOff; 

* Wefine kNunSpclSections 3 
#define kHaxLoadableSectfons 2 
35 #def1ne kNaxPeff Sect ions 3 
int NunPeff Sect Ions; 
int NunLoadableSections; 

int nextSectUun; 

40 int codeSectNim = -1; 

int dataSectNun « -1; 

int loaderSectNum e 

FileHeader fileKdr; 
45 SectionHeader sections [kMaxPeff Sect Ions}; 
byte strings [20]; 

LoaderHeader IdHdr; 

50 Loa<terRelExpHeader loaderHdrsAry [kNunSpclSections ♦ kMaxLoadableSectfons]; // 

LoaderRelExpHeaderPtr loaderHdr = ft loaderHdrsAry CkNuifipe I Sections]; 
LoaderlmportFilelOPtr infJortFiles; 
Loader IrrportPtr inportSynbols; 
long inportNext; 
55 HashSlotEntryPtr exportHashSlot; 

HashChainEntryPtr sortedHashChain. exportHashChain; 
LoaderExportPtr sortedExportSynbols, exportSynbols; 
long exportNext; 
long sortedExportNext; 

60 

long *XCOFFSyinIndexToPeffIirportNuriber; 
long lastFileAddr; 
65 Boolean gPackedData; 
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Boolean optKuUPad; 
Boolean optBSSExpand; 
Boolean optKeepHIT; 
Boolean optGlobalStringTab; 
5 Boolean optNoPackData; 

Boolean debug; 

byte •expFi leNatne; 
10 byte *outFileName = (byte •) "a.pef"; 

#define kDef Creator '???\?' 
#define kOefType 'APPL' 
unsigned long gFileCreator = 0; 
15 unsigned long gFtleType « 0; 
FInfo gFinderlnfo; 

Boolean entryllsed, initUsed, termUsed; 

20 byte *entryPoint; 

long entryPointHameLength; 

byte *injtPoint,- 

long InitPointNamelength; 

byte *tennPoint; 
25 long temPointNameLength; 



byte *libInfo [1001; 
30 int libOpts; 

long peffCodeVAddr; 
long peffDataVAddr; 

35 typedef struct < 

long format; 

byte *oldnaii«; 

byte *nanie; 

long secn; 
40 long offset; 

Int class; 
> AddExpType; 
#def1ne kAbsolute 0 
Meflne kEquivalence 1 

45 

AddExpType *addExpAry; 
long addExpCnt; 

typedef struct Weak Import ( 
50 struct WeakHflport •next; 

byte *lTbName; 
byte *naine; 
} Ueaklnport; 

55 UeaklB^rt *WeakIrnportL{st; 

int AddUeaklmport ( byte *libNaine, byte *name ); 
int CheckUeak ( void }; 

void Check InportsForUndef OK ( int filelndex }; 
60 int filecnt; 

byte *gTooiN8ine; 

«1fdef JNA /• for simple AIK testing 0*H*L*Y */ 

65 #include «h8sh.c» 
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#1nclude ••myMacLib.c'* 
«end{f 

main (fnt argc, byte "argv) 
5 < 

long' I en; 
byte •buf; 

int err; 

long sectionlndex; 
10 byte *libMaffle; 

byte *ptr; 
byte *inFileName; 
{nt f; 

15 gToolName « PartlalName ( argv [0] }; 

for <i B 1; i < argc; i**) < 

If (argv [i] CO] |b '-') < fileCnt**; InFileName » argv [f]; continue; > 
switch (argv [il [1]) C 
20 case 'a* : if (i+1 >= argc) break; 

ALIGNMENT = Getint ( argv C**ll, 0, 0 ); 
continue; 
case 'b' : optSSSExpand = TRUE; 
continue; 

25 case 'c' : if (i*1 >= argc) break; 

currentVersion = Getint ( argv [♦*!), 0, 0 >; 
continue; 
case 'd* : if (f4>1 >- argc) break; 

olcftefVersion « Getint ( argv [♦♦il, 0, 0 ); 
30 continue; 

case 'e* : if (i*l >= argc) break; 

entryPoint « argv [♦♦i]; 

entryPointNameLength a strlen ( (char*) entryPoint ); 

continue; 

35 case 'f : if (i*l >= argc) break; 

if ((argv [i] [2]) « 'c') /* file creator */ 

convertStringZLong ((byte*)argv [♦♦ij, ft gFileCreator); /• yuck! is there a 
better way to do this? */ 

else if ((argv CiJ [2]) 't') /• file type •/ 

40 convertString2Long ((byte*)argv [++ij, & gPileXype); /* yuck! is there a 

better way to do this? */ 
else 

expFileName = argv 

continue; 

45 case 'g' : optGlobalStringTab s TRUE; 

continue; 
case 'i' i if <i*1 >= argc) break; 
initPoint b argv 

initPointHameLength ■ strlen < (char*) initPoint ); 
50 continue; 

case M' : printf (•'# Xs: Bad switch -I (ijpper-caseM).\n« Old you nean 
lower-case-L insteadTXn", gToolNaroe); 
return 1; 
case 'k' : optKeepMIT = TRUE; 
55 continue; 
case 'L' : 

case 'I' : if (i*1 >= argc) break; 

llblnfo (libOpts^+l s argv [++i3; 
// format: -I I ibNamelsrename) [#v1(-v2)] I!) 
60 continue; 

case 'n' : optNullPad = TRUE; 

continue; 
case '0' : if (i+1 >= argc) break; 

outFileName s argv [♦+»; 
65 continue; 
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10 



15 



20 



25 



30 



case 'r' 



case 's' 



case 't' 



case 'u' 



case 'V' 



dataSection = kConstantSection; 
continue; 

if >s argc) break; 

shareOption » Getint ( argv [^-♦i), 0, 0 ); 

continue; 

if <i*1 >= argc) break; 
ternPoint « argv [♦♦i); 

termPointMameLength = strlen ( (char*) ternPoint >; 

continue; 

if (i+1 >3 argc) break; 

oldlmpversion e Getint ( argv t-^-^i), 0, 0 }; 

continue; 

if (i*i >= argc) break; 



sectionlndex » Getint ( argv [♦♦ij, & ptr, 0 ); 
if < *ptr** Is ) break; 
:h (sectionlndex) < 

0 : peffCodeVAddr s Getint t ptr, 0, 0 ); continue; 

1 : peffDataVAddr « Getint < ptr, 0, 0 >; continue; 



SMftch (sectionlndex) < 
case 0 
case 

> 

break; 

case 'w' : if (i*1 >b argc) break; 
UbName = argv [♦♦!); 
ptr - UbName; 

Mhile < *ptr &S *ptr }s ':' > ptr*4; 
if (*ptr != ':•) break; 
•ptr** « 0; 

AddUeak Import ( UbName, ptr ); 

continue; 
case 'x' : optMoPackData s TRUE; 

continue; 



help: printf ("# %s: usage: Xs [ options J [ -o outfile ] infile\n", gToolNaine, gToolName); 
printf <"# (numbers argunents are read in decimal iviless preceed by Ox ierOxlOO a 



name 
fname 
-fc name 
-ft type 



name 



35 hex 100>\n"); 

printf <"# optionstVn"); 
printf (•'# 
printf ("# 
printf (»# 
40 printf ("# 

printf ("# 
printf (»# 
printf ("# 
printf («# 
45 printf («# 

printf (••# -k 
<defsreinove)\n''); 

printf ("# -I 
printf («# 
50 printf ('•# 

printf ("# 
printf ('•# 
printf ('•# 
printf (»# 
55 printf (••# -n 

(def^onuU>\n">; 

printf ("# -o 
printf (»# -r 
printf ("# -s 
60 (def-contextShare)\n''); 

printf ('•# -t 
printf ("# -u 
printf ('•# -V 
printf («# -u 



align code & data secitons to # byte boundary (defB8)\n»); 
expand bss into zeroed data Cdefsno>\n"); 
set current -vers ton to # <defsO)\n"); 
set definition version to # <defsO)\n"); 
set entry point to routine name (defsi4)at-XC0FF-says)\n'*}; 
use file fname to load additional export UslNn"); 
set the creator of the output ffle\n"); 
set the file type of the output file\n"); 
set initialization routine to procedure name (defenone)\n*'); 
keep init.tem & nafn routines as export symbots 



oldhaiRe[enewname) (#versl-upfvll [!) M\n"); 

change imported library with name olckiane:\n"); 
enewname to newnameXn"); 

#vers to current -vers ion vers (# required)\n"); 
-upfv to up- from- vers ton upfv\n"); 
i to i nit before\n"); 

to ueak import\n"); 
pad exported loader syni>ols with trailing null 

fname set the target output file name (defsa.pef >\n«); 

mark data section as read-only (defBread-urite)\n"); 
# set share option of data section to it 



65 to be undef at runt{me>\n"); 



set termination routine to procedure name (def=none)\n"); 
set implementation version to # (def=0)\n«); 
make section m have default virtual address #2 (def=0)\n")* 
UbName: ImpNane make infwrt infMane fran library IfbNane ueak (ok ' 



name 
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printf -X disable automatic data section packing\n"); 

printf ("# Xs version 2.13 <b6)\n", gToolName); 
return 1; 

} 

5 

if (expFileNaiae U fileCnt »= 0) { > 
else if (fileCnt l» 1) goto help; 

«if MAC 

10 SpinCursor ( 32 ); 

«endif 

if (expFileName) { 

err - ProcessEx port List {); 
15 if (err) return 1; 

> 

«if MAC 

SpinCUTSor C 32 >; 
20 #end{f 

if (fileCnt) < 

err = OpenRead ( inFileName, & buf, & len ); 
if (err) return 1; 

err s checkXCOFF ( buf ); 
if (err) { printf ( «# Xs: 'Xs' not XCOFF executable file fonnat\n'» gToolNanie. argv 
[11 ); goto EBOT; > i a , a 

30 err c FindStuff ( buf ); 

if (err) goto EBOT; 

} ^ 

#if MAC 

35 SpinCursor (32 ); 

#endif 

CoinputeSeetiens (); 

40 «{f MAC 

SpinCursor ( 32 ); 
0end1f 

err s symCvt ( IdSyms, syinCnt ); 
45 if (err) goto EBOT; 

Ifif MAC 

SpinCursor ( 32 ); 
#endif 

err = SymSort (); 
if (err) goto EBOT; 



25 



50 



#if MAC 

55 SpinCursor ( 32 ); 

#endif 

if (fileCnt) < 

err B RelCvt ( IdRelocs, relCnt ); 
60 if (err) goto EBOT; 

> 

#if MAC 

SpinCursor ( 32 ); 
65 #endif 
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if ( dataSectNum >s 0 && ! optBSSExpand && ! optNoPackData ) 
gPackedData = TryPackingData (); 

#if MAC 
5 SpinCursor ( 32 ); 

«end1f 

ConputeOffsets (); 

10 FillHeaders ( {nFiieMane ); 

#if KAC 

SpinCursor < 32 ); 
#endH 

err s UriteFile ( outFileName ); 
if (err) goto EBOT; 



15 



#if MAC 

20 SpinCursor ( 32 ); 

«endif 

return 0; 

25 EBOT : 

printf <"# Xs: *Xs' not converced\n", gToolName, inFileMame); 
return 1; 
> /* main () V 

30 

^* *****•***«*«********«•**««**••••••• jin^ «•*««•**•««•*•«•«************•*«*******« • 

/* 

/* The Mac OS does not allow you to seek past EOF. 1 put in this routine to figure * 
/* out if we're trying to do so, and write an appropriate ntmber of bytes of zero * 
35 /* instead. 

^* *•****•***•••«***••«••«•*•••••**••* jpiy^ **************************************** * 

void my SetFPos (short refNun, short mode, long offset) 
< 

40 #ifdef MAC 

H^ragna unusedCmode) 
byte zero « 0; 
long i; 

long curLength; 



45 



GetFPos (refNum, fteurLength); /* get current size of file V 



if (curLength < offset) 
{ 

50 for (i = curLength; i < offset; i+*> 

< 

Urite ( refNun, Azero, 1); 

> 

return; 

55 ) 
#else 

SetFPos (refNum, mode, offset); 
«endif 
} 

60 

Boolean TryPackingData ( void } 
{ 

byte •block; 
65 long length; 
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byte *target, *end; 
OSErr err; 

block s datastart; 
5 length = dataLen;* 

target = (byte •> NeuPtr ( Imgth ♦ 10 >,• 
if ( ! target ) { 

10 retuiH^FALSE-'^' '^^ allocate target buffer (Xd bytes>\n-. gToolNane, length ); 

end c target ♦ length; 

err = PackData ( block, block ♦ length, target, & end ); 
15 H ( err > { 

printf <"# Xs; Packed Data not smaller, not packlng...\n«. gToolName)* 
return FALSE; ' 

) 

20 byte *compBuf; 

register int i; 

// printf ( "Unpacked Size «= Xd\n", length ); 

// printf ( "Packed Size Xd\n", gPackedSize >; 

// printf ( "Actual Packed size = Xd\n", end - target >; 

ccmpBuf « (byte *) NewPtr ( length ); 
if ( ! conpBuf ) { 

30 lengthV^"** ^"^ allocate coofiarison buffer (Xd bytes>\n", gToolName, 

return FALSE; 

> 

,c * ""PBCkData ( target, end. ccwpBuf, cocipBuf ♦ length ); 

■33 if ( err ) { 

printf (»# Xs: Error unpacking data for verification; not packing dataVn". 
gioolName); 



40 



55 



return FALSE; 
> 



for ( i = 0; \ < length; i++ ) { 
if ( compBuf [i] re block tij > { 

printf ('•# Xs: Verify error at byte offset Xxj not packing data\n". gToolName !)• 
return FALSE; . a v , » ncwic, w, 

45 > 

> 

> 

packedDataStart s target, 
50 paekedDataLen = end - target; 

dataSeetfon b kPIOataSection; 



return TRUE; 
> /* TryPackingOata {) •/ 



void ConputeSections ( void ) 

60 nextSectNun « 0; 

lastFileAddr s sizeof ffleHdr; 

if ( codeLen ) < 
65 codeSectNum » nextSectNum*~»; 
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lastFileAddr *' sizeof (SectionHeader); 



dbsLen » dataLen ♦ bssLen; 
5 if ( dbsLen ) < 

dacaSeetNum « nextSectNunn+; 
lastFileAddr sizeof (SectionHeader); 

> 

10 loaderSectNun » nextSectNuiH-^; 

lastFlleAddr +b sizeof (SectionHeader); 



15 



40 



50 



55 



if ( optGlobalStringTab ) 

lastFileAddr ♦= sizeof strings; 

> /* CoaputeSections () V 



long Align ( long offset, long aln ) 
20 < 

int left = offset & (aln-1); 
if (! left) return offset; 
return offset ♦ aln - left; 
) /• Align {) */ 

int Coinputeoffsets ( void ) 
long i; 

30 long loaderSectionSfze; 

if (dataSectMun >= 0) { 

loaderHdr IdataSectHun] . nuRiRe locations = relUordCnt; 

loaderHdr CdataSectMunl . relocationsOffset « 0; 

35 ) 



for (i B kReExport Import; i < kHaxLoadableSections; }♦+) 

if (loaderHdr li). nunRelocatlons) // ;| loaderHdr ti) . nun£xports) 

IdHdr. fRfflSections-M-; 



IdHdr. relocationsOffset - 

sizeof (LoaderHeader) ♦ 

IdHdr. numlnportPiles * sizeof (LoaderlmportFi lelO) + 
IdHdr. nunlirportSyms * sizeof ( Loader luport) ♦ 
4^ IdHdr. nunSections • sizeof (LoaderRelExpHeader); 



IdHdr. stringsOffset « 

IdHdr. relocationsOffset * 
relUordCnt * sizeof (Reloclnstr); 

IdHdr. hashSlotTable ■ Align ( IdHdr. stringsOffset ♦ stringTabSize, 4 >; 

loader Sect lonSize « IdHdr. hashSlotTable * 

(1 « IdHdr. hashSlotTabSize) * sizeof (HashSlotEntry) ♦ 
IdHdr. nunCxportSyms * sizeof (HashChainEntry) ♦ 
IdHdr. n«i£xportSyins • SIZEOF LoaderExport; 



// HOTE : 

// The section minbers of the sections is set by the macro's 

60 loaderSectNum,codeSectNum&dataSectNiAi 

// which are used to nurber the sections and set their section header order. 

// Rearange as desired by changing macros loaderSectNun, codeSectNun & dataSectNun 

// Be sure to Iceep the the nuifcers (and therefore headers) of the loadable 

// sections to be before the ntfifaers (and therefore headers) of the 

bb // non- loadable sections. 
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// HOTE: the order of the 3 data of the sections betou determines 

// the file order of the sections 

// the cods that actually writes to the file peforms independently 

// from the order of sections in the file 



// Loader Section 

sections [ loader Sec t MumJ . containerOffset « Align ( lastFileAddr, 8 > ; 
sections t loader Sec t Numl . rawSize s loaderSectionSize; 
sections (toaderSectNun] . aligrment > Log2 ( A ); 
lastFtltAddr = sections [loaderSectMunO . containerOffset ♦ 
sections CloaderSectNun] . rauSize; 



if (codeSectNun 0) ( 
// Code Section 

sections [codeSectNiaa . containerOffset - Align < lastFileAddr. ALIGNMENT ); 
sections tcodeSeetNuiO. rauSize - codeLen; 
sections [codeSectNun}. initSize ^ codeLen; 
sections [codeSectNum] . exeeSize e codeLen; 
sections [codeSectNun]. alignment = LogZ ( ALIGNMENT >; 
lastFileAddr = sections [codeSectNum]. containerOffset ♦ 
sections [codeSectNun]. rawSize; 



if (dataSeetNum >s 0) { 
// Data Section 

CNMENT)')"* • containerOffset s Align < lastFileAddr, (gPaekecOata ? 4 

sections IdateSectNun) . rawSize » gPackecCata ? packedDataLen ! 

(optBSSExpand ? dataLen + bssLen : dataLen); 
sections tdataSectNum] . initSize = (optBSSExpand ? dataLen ♦ bssLen : dataLen)- 
sections [dataSectNun] . execSize «= dataLen ♦ bssLen; ' 
sections [dataSeetNum]. alignment = L092 ( ALIGNMENT ); 
lastFileAddr « sections [dataSectNun] . containerOffset ♦ 
sections [dataSectMun] . rawSize; 



return 0; 
J /• CoinputeOffsets (> •/ 

int FillHeaders ( byte *inFi leName ) 

static byte sectionNames [] = "codeXOdataNOloader"; 

novebytes ( sectionNames. strings, sizeof 8eetionNaii«s ); 

« peffHagicI; 
= peffHag{c2; 
= peffTypelO; 
= PowerPC 10; 
e kPEFVersion; 



= fileHdrPtr -> fhTioOat 7 fileHdrPtr -> fhTinOat t 
e NacTime ( fileHdrPtr •> fhTinCat >; 



» oldDef Version;' // option 

« oldlrapVerslon; // option 



fileHdr. magici 
fileHdr. magicZ 
fileHdr. fileTypelD 
fileHdr. archttectureio 
fileHdr. versionNinber 

«1f HAC 

fileHdr. dateTfneStanf) 
CetFileTime ( inFl leName >; 
#else 

fileHdr. dateTlmeStamp 
#endif 

fileHdr. oleCefverslon 
fileHdr. oldlnpVersion 
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flleHdr. currentVersion = currentVersion; // option 

fileHdr. mmberSections = nextSectKum; 

fileHdr. loadableSections « nextSectNun - 1; // loader section not •'loadable'* 

5 sections (loaderSectMun) . sectionName = optGlcAalStringTab ? 10 : -1; 

sections [ loader Sec tN una . sectionAddress = 0; 

sections [loaderSectKun} . regionKind ^ kLoaderSection; 

sections (loaderSectNuiO. shareKind - kGlobalShare; 

10 If (codeSectNun >« 0) i 

sections (codeSectNuiO . sectlonName = optGlobalStrfngT^ ? 0 : -1; 
sections [codeSectNum] . sectionAddress = pef fCodeVAddr; // option 

sections [codeSectNum}. regionKind = kCodeSection; 
sections [codeSectNun]. shareKind = kGlobalShare; 

15 > 

if (dataSectNun >= 0) C 

sections [dataSectNun}. sectionName ~ optGlobalStringTab ? 5 : -1; 

sections CdataSectNun} . sectionAddress *> peffOataVAddr; // *- option 

20 sections [dataSectNun] . regionKind - dataSectlon; // -- option 

sections [dataSectNun]. shareKind » shareOptlon; // option 

loaderKdr [kReExport Import] . sectionNumber = kReExportloport; 
25 loaderHdr [kPhysicalExport] . sectionNunber s kPhysica I Export; 

loaderHdr CkAbsoluteExport] . sectionNunber = kAbsotuteExport; 



30 



If (codeSectNum 0} 

loaderHdr [codeSectNun). sectionNuiber = codeSectNun; 

if (dataSectNun >= 0} 

loaderHdr [dataSeetlitunl . sectlonNuiAier = dataSectNun; 



return 0; 
35 > /* FillHeaders <) */ 

int UriteFile ( byte *outFi leName ) 

40 short refMum; 

long i; 
int err; 
Str255 pStr; 

45 err = MakePStr < outFileName, strlen ( (char*) outFileName ), pStr. sizeof pStr ); 

If (err) return err; 

err ■ KCreate ( 0, 0, pStr, kDef Creator, kDefType ); 
if (err < 0) C 
50 #ifdef HAC 

\i (err 1= dupFNErr) { /• duplicates are okl 7 •/ 
#endif 

printf (»# %s: Error creating Xs Xd\n», gToolNane. outFileName, err); 
return -1; 
55 #1fdef MAC 
> 

tfendlf 
> 

60 err = HC^ierDf ( 0, 0, pStr, fsRdUrPerm, 8 refNum ); 

if (err < 0) < 

printf (»# Xs: Error opening Xs Xd\n", gToolName, outFileName, err); 
•return -1; 

65 ' 
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// File Header 

Mrite ( reflliin, (byte •> ft fileHdr, sizeof fileHdr >; 
// Section Headers 

for (f s 0; i < fileHdr. nunberSections; i**) 

^ urite < refHim. (byte •) & sections [i]. sizeof (SectionKeader) ); 



// Global String Table 
if ( optGtobalStringTab } 
< 

^ Write < refNua, strings, sizeof strings >; 



// Loader Section 

my.SetFPos < refNun, fsFromStart. sections [loaderSectNiW] . conteinerOf fset ); 
Urite < refuum, (byte •) ft idHdr, sizeof IdHdr ); 



// Loader RetExp Headers 
Write ( refNun, (byte •) importFi les, IdHdr. numlmportFiles * sizeof 
(LoaderloiportFilelO) >; 

25 ^ «*efMi«i, (byte *) importSyirbols. IdHdr. nuRlii|K>rtSyms • sizeof (Loaderli^Mrt} ); 

for (i = kReExport Import; i < kHaxLoadableSections; i*+) < 

if ( loaderHdr [i]. nunRe locations ) // j j loaderHdr tij. nui£xports ) 

^ Wite ( refNum, (byte •> ft loaderHdr [i], sizeof (LoaderRelExpHeader) ); 

> 

Write ( refNun, (byte *> relocations, relWordCnt * sizeof (Reloclnstr) ); 
Urite ( refNuro, (byte •) stringTable, stringTabSize ); 
Jb my.SetFPos ( refNiM, fsFromStart, sections (loaderSectNum). contalnerOffset ♦ 

IdHdr. hashSlotTable ); 
// to skip any padding after string table 

Urite ( refMun, (byte *) exportHashSlot, (1 « IdHdr. hashSlotTab$ize) • sizeof 
40 (HashSlotEntry) ); 

^ Urite ( refMun, (byte *) sortedHashChain, IdHdr, numExportSyns • sizeof (KashChatnEntry) 

Urite ( refNun, (byte •) sortedExportSymbols, IdHdr. numExportSyns * SIZEOF LoaderExport 

45 

// Code Section 

ff (codeSectNun >e 0) { 

my.SetFPos ( refNun, fsFromStart, sections [codeSectNun]. containerOffset ); 
50 ^ Urite ( refNun, codeStart, codeLen ); 

// Data Section 
55 if (dataSectNun >s 0> { 

iny^SetFPos ( refMun, fsFromStart, sections CdataSectHun] . containerOffset ): 
if ( gPackedData } ' 

Urite ( refMun, packe<DataStart, packedDataLen ); 
else 

Urite ( refNun, dataStart, dataLen >; . 
if (optBSSExpand) 
{ 

Urite ( refNun, bssStart, bssLen ); 

65 > ^ 
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// End of file padding 

if ( lastFileAddr Align ( lastFileAddr, ALIGNMENT ) } { 

my^SetFPos ( r«fNun, fsFromStart, Align ( lastFileAddr, ALIGNHENT ) • 1 }; 
5 Write < refNum, (byte •) 1 ); 

) 

«if MAC 

SetEOF ( refNun. Align ( lastFileAddr, ALIGNHENT ) ); 
10 #endif 

FSClose ( refNun ); 
15 /• 

if creator/filetype were specified, set 'em now... 

*/ 

if (gFileCreator |; gFileType) 
i 

20 #if MAC 

Str25S pStr; 



25 



35 



50 



err = MakePStr ( outFileName, strlen ( (char*) outFileName ), pStr, sizeof p6tr }; 
if (err) return err; 



err e HGetFInfo ( 0, 0, pStr, & gFinderlnfo >; 
if ( err ) { 

printf (••# Xs: cannot get file type and creator info on Xs (Xd)\nM, gToolNa 
outFileName, err); 
30 return 1; 

> 



if ( gFileCreator } 

gFinderlnfo. fdCreator = gFileCreator; 

if ( gFileType > 

gFinderlnfo. fdType s gFileType; 



err = HSetFInfo ( 0, 0, pStr, & gFinderlnfo ); 
40 if ( err ) ( 

printf (••# %s: cannot set file type and creator info on Xs (Xd)\n", gToolName, 
outFileName. err); 
return 1; 

45 #else 

printf ("# Xs: cannot set file type and creator on this systen\n", 

gToolName); 
«end1f 
> 



return 0; 
} /* WriteFile () •/ 



55 byte •SkipCount ( byte •thisChar, int liblndex, int offset, fnt *length > 
int skfpcount; 

for (SkipCount = 0; skipCount < (3 ♦ liblndex*3 ♦ offset); skipCount**) { 
60 while (*thisChar !e O) thischar^^; /* Skip the current string */ 

thisCharM>; /• Get to the start of the next string •/ 

> 

*length o strlen ( (char*) thisChar ); 
return thisChar; 
65 ) /• SkipCount () V 
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10 



15 



20 



25 



40 



45 



byte *GetLtbNaine ( byte nhisChar, fnt liblndex. int *length ) 
< 

static byte name «[256]; 
byte *fUeH, •menfaH; 
int fLen, nLen; 

fileH = SkipCoiFit ( thisChar, liblndex, 1, & fLen ); 
membN = SkipCount ( thfsChar. Ublndex, 2, & inLen ); 

if (! mLen) < 
* length « fLen; 
return fUeN; 

> 

movebytes ( flleN, name, fLen ); 
name tfLenl = 

p»vebytes ( meinbN, name+fLem^l , mLen }; 
name CfLen+l+mLenJ e 0; 

•length = fLen ♦ 1 ♦ mLen; 
return name; 
> /* GetLibName (> V 



GetName ( TLdSyirPtr sym« byte **name, Int *n«neLength > 
( 

int I en; 



30 if ( syrn -> IsZeroes I- 0) < 

byte *ptr = (byte •) sym -> IsName; 
•name * ptr; 
ten « 0; 

uhile (ten < 8 ftft "ptr) < ptr**; len++; > 

J 3 ) 

else < 

♦name = IdStrings ♦ sym -> IsOffset; 
ten a strlen ( (char*) *name ); 

> 



•naneLength " len; 

return 0; 
> /• GetNane () */ 



SizeString ( int length, Boolean needNull ) 
( 

stringTabSize += length; 
50 if ( needNull \ \ optNuUPad ) 

StringTabSize**; 

> /• SizeString (> */ 

55 Addstring ( byte *name, int length. Boolean needNull ) 

< 

movebytes ( name, stringTable ♦ strfngTafaOff, length ); 
stringTabOff ♦= length; 
if ( needNull •} optNullPad ) 
60 StringTabOff ♦♦; 

> /• AddStrtng () •/ 



long HapSecNinToPeff ( long secNm. long 'defAddr ) 
65 c 
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if (secNm < 0) C 
*defAddr s 0; 
return secNm; 

> 

5 

if (secNm == secNum [CODE INDEX]) { 
*defAddr s codeVAddr; 
return codeSeetNum; 

> 

10 if (secNm » secNin [DATAIKOEX]} ( 

*defAddr = dataVAddr; 
return dataSectNum; 

> 

if (secNm secNum CBSSINDEX]} { 
15 // the next line shcxjld result in subtracting out the start of bss, 

// and adding in the length of data to the syirbol's address, 
// however, since XCOFF always places bssVAddr = dataVAddr + dataLen; 
// this line could be reduced to *defAddr s dataVAddr; 
*def Addr = bssVAddr - dataLen; 
20 return dataSectNim; 

> 

return -1; 
> /* HapSecNumToPeff () V 



int ClassOf ( TLdSyirPtr syn ) 
{ 

switch ( sym -> Is^lass } C 
case XHC PR : return kCodeSyntoo I ; 
30 case XMC"SV : /* fall thru */ 

case XMC~DS ; return kTVectSyniX) In- 
case XmOc : return kTOCSynbol; 
default : return kOataSyirbol; 
> 

35 > /• ClassOf () V 



Boolean IsValidlnport ( TLdSymPtr syn ) 

( 

40 /* 

don't reniove iofNsrt syirbols - 
if ( syn >■ refflSym CO] } return 0; 
if ( syn " remSym CI] ) return 0; 
if ( syn " remSym C2] ) return 0; 

45 V 

return IsLdlinport ( *sym ) && 
sym -> IsIFi le > 0 && 
syn •> IsIFile <= IdHdr. numlnportFi les; 
50 > /* isvalidimport (} */ 



Boolean IsReExportlnywrt ( TLdSynPtr sym ) 

55 // remove export syntols only 

if ( sym =e remSyn [0] ) return 0; 

if ( sym == remSyn [1] ) return 0; 

if ( sym es remSyn 12) ) return 0; 

60 return IsLdExport ( *sym ); 

> /* I sReExport Import <) */ 



Boolean IsValidExport ( TLdSymPtr sym ) 
65 < 
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// remove export syntwls only 
H ( $ym OS remSyn CO] ) return 0; 
If ( sym == refflSym [1] ) return 0; 
^ if ( sym » remSym tZJ ) return 0; 

return IsLdExport ( *syin ) && 

(sym •> IsScNim == secNum [CODEINOEX] 
sym •> IsScHixR secNun COATA INDEX] ;; 
sym -> IsScNun == secNum [BSSINDEX]); 
10 > /* IsVaUdExport (} V 

AddE Xpert ( long seen, long theAddr, byte •name, int nameLength, int symClass ) 
{ 

15 exportSymbols [exportNext] . symClass = symClass; 

export Symbols [exportNext] . nameOffset » stringTabOf f ; 
exportSymbols [exportNext]. sectionNurber e seen; 
// loaderHdr [seen). nutnExports**; 

exportHashChain [exportNext). hashuord c CFLHash ( name, naneLength ); 
20 exportSymbols [exportNext]. address = theAddr; 

exportNext-»^4; 
> /* AddExport <) •/ 

25 Addlmport { !nt XCOFFindex, TLdSymPtr sym, byte •name, int nameLength > 

int symClass = ClassOf ( sym >; 

XCOFFSymlndexToPefflmportNimber [XCOFFindex] c importNext; 
30 importSyirtJOls [importNext] . symClass = symClass; 

importSyinbols [importNext]. nameOffset s stringTabOff ; 

// if both import & export, add to both 

// tables, but only one string table entry 

if (IsReExportltnport < sym ) ) 
35 AddExport { IcReExport Import, importNext, name. nameLength, symClass ); 

importNext'*"*'; 
) /* Addlmport () •/ 

40 int symCvt ( TLdSymPtr IdSyms, long symCnt ) 
< 

TVersiort MnkedVer, otdlmpVer; 

TBoolean IB; 

long filelndex; 
45 long symlndex; 

byte *name; 

int nameLength; 

byte *ldFiles; 

long size; 
50 TLdSynPtr sym; 

long cnt; 

long seen; 

long defAddr; 

long sNum; 
55 TBoolean istmp; 

// set inital values to these 
if (fileCnt " 0 j| auxHdrPtr -> ahEntry =« -1) < 
60 IdHdr. entrypointsection - IcNoRegion; 

> 

else { 

IdHdr. entrypointsection - HapSecHumTePeff <auxHdrPtr -> ahSrCntry, ft defAddr); 
IdHdp. entryPointOffset = auxHdrPtr -> ahEntry - defAddr: 

65 > 
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IdHdr. initPointSection - kNoRegion; 
IdHdr. tern<>ointSection " kNoRegion; 

5 // determine sizeof string table 

// first • sfze of import lib names 

if (IdHdrPtr) { 

/* Skip the first file string, it is the UNIX path V 
10 IdHdr. numlnportFiles « IdHdrPtr -> IhNInpID - 1; 

IdFiles s (byte •> IdHdrPtr ♦ IdHdrPtr -> IhlnpOff; 

) 

for (filelndex - 0; filelndex < IdHdr. nunlrportFi les; f i leIrKlex++) <; 
15 naow = GetLibNaine < IdFiles, filelndex, & nameLength ); 

CheckLibOpt ( & name, & nameLength. & linkedVer, & oldlmpVer. & IB. >; 
SizeString ( nameLength, TRUE >; 

) 

20 

// next size of all symbol's names 
// and types of them • import vs. export 
for (symlndex = 0; symlndex < symCnt; syinlndex-*-^) { 
sym - & IdSyns [symlndex]; 
25 GetName ( sym, & name, & nameLength >,* 

// identify entryPoint, initPoint, termPoint, by name 
// as user specified options, and remove from general 
// processing of synixils. 

30 

isimp s IsValfdImport ( sym ); 

If (islmpu I sReExport Import ( sym ) [| isValidExport ( sym ) ) { 
if ( nameLength entryPointNameLength At 

comparebytes { name, entryPoint, nameLength ) > { 
35 if ( isImp } { 

IdHdr. entryPointSection = kReExport Import; 
IdHdr. entryPointOffset = sym -> IsValue; 

> 

else < 

40 IdHdr. entryPointSection = HapSecNunToPeff ( sym -> IsScNum, & defAddr >; 

IdHdr. entryPointOffset = sym -> IsValue - defAddr; 

> 

if <! optKeepNIT) remSym [0] = sym; 
entryused • TRUE; 
45 > 

if ( nameLength == initPoint NameLength && 

comparebytes ( name, initPoint, nameLength ) ) C 
if ( isimp ) < 

IdHdr. initPointSection *> kReExport import; 
50 IdHdr. initPointOffset s sym -> IsValue; 

> 

else < 

IdHdr. initPointSection ° KapSecNumToPeff ( sym -> IsScNun, ft defAddr >; 
IdHdr. initPointOffset = sym -> IsValue - defAddr; 

55 > 

if (• optKeepHIT) remSym [1] » sym; 
initUsed - TRUE; 

) 

if ( nameLength =s tenri>ointNameLength U 
60 comparebytes ( name, temPoint, nameLength ) ) { 

if ( isImp ) I 

IdHdr. termPointSection = kReExportlmport; 
IdHdr. termPointoffset = sym -> IsValue; 

65 else { 
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IdHdr. termPofntSection s MapSecNunToPeff ( sym -> UScNum, ft defAddr }; 
IdHdr. temPointOffset ^ sym -> IsValue • defAddr; 

> 

if <l optKeepMlT) refflSym (23 = sym; 
5 ternUsed « TRUE; 

> 

> 

10 if ( IsValidlmport ( sym > } { 

IdHdr. numlraportSyias+-*-; 
SizeString < nameLength, TRUE ); 
if (IsReExportlinport ( sym )) 
IdHdr. numExix)rtSyms++; 

15 > . 

else If ( IsValidExport ( sym ) ) < 
IdHdr. nuiCxportSyins+^; 
sizeString ( naneLength. FALSE ); 

20 > ^ 

if ( entrypoint &ft ! entryUsed ) 

printf (••# Xs: warning, the specified entry point 'Xs' was not found\n", gloolName, 
25 entrypoint); 

if ( initPoint SA ! initUsed ) 

printf ("# Xs: warning, the specified init routine 'Xs' was not found\n'i gToolName. 
initPoint); 

if ( ternPoint && ! tenrUsed ) 
30 printf («# Xs: warning, the specified term routine 'Xs* was not found\n<< gToolName, 

teniPoint); 

for (symlndex = 0; symlndex < addExpCnt; symlndex^-i-) { 
35 SizeString ( strlen ( (char*) addExpAry [symlndex]. nam ), FALSE >; 

tdHdr. nuRCxportSyms-M-; 

> 

40 // allocate syniwl string t^les and iiifx>rt / export tables 

stringTable s Malloc ( (size = stringTafaSize) ♦ 4 >; 
if (! StringTable) goto Mem Error; 
clearbytes ( stringTable, sTze ); 
stringTabOff s 0; 



45 



XCOFFSyrolndexToPeff iBiportNunber = (long *) Halloc ( size = sizeof (long) * synCnt ); 
if (! XCOFFSymlndcxToPeff ImportNunber) goto Mem Error; 
clearbytes ( (b^te *) XCOFFSymlndexToPeff itnportMunber, size ); 



50 importFiles = (LoaderlmportFi lelOPtr) 

Malloc ( size = sizeof (LoaderlmportFi I elD) * IdHdr. nunlnportFiles ); 
if (I importFiles) goto Mem Error; 
clearbytes ( (byte *) importFiles, size }; 

55 importSyniwls = (LoaderliiportPtr) 

Malloc ( size = sizeof (Loader Import) * IdHdr. nunlnportSyms ); 
if (I importSymbols) goto Mem_Error; 
clearbytes ( (byte •) importSynfcols, size ); 
inportNext = 0; 

60 

exportSymbols = (LoaderExportPtr) 

Malloc ( size - sizeof (LoaderExport) • IdHdr. nunExportSyms ); 
if (I exportSymbols) goto Mem_Error; 
clearbytes { (byte •) exportS^nbols, size ); 
65 exportNext = 0; 
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exportHashChain > (HashChainEntryPtr) 

Malloc (size - sizeof (HashChairfntry) * IdHdr. nuD£xportSyms ); 
if (! exportHashChain) goto Mein_£rror; 
clearbytcs < (byte *) exportHashChain, size ); 

// extract imported library symbols 

for (filetndex » 0; fUelndex < IdHdr. nunlnportFi les; f i lelndex^*) < 
nane ■ GetLibName ( IdFiles. filelndex, & namelength >; 
CheckLibOpt ( & name, & nameLength, & linkedVer, & oldlmpVer, & iB ); 
ifnportfiles (filelndexl. f i leNaroeOf f set » stringTabOff; 
importFiles [filelndexl. linkedVersion » linkedVer; 
importFiles [filelndexl. oldlmpVersion « oldlmpVer; 
importFiles [filelndex]. initBefore = iB; 
AddString ( name, naneLength, TRUE ); 



// extract inports for each library 

for (filelndex e O; fUelndex < IdHdr. nunlinportFi les; f ilelndex^-*^) C 
// add imported symbols, in order of the files they belong to. 
cnt = 0; 

importFiles [filelndexl. irapFirst * inportNext; 
for (symlndex « 0; symlndex < synCnt; symIndex-*-+) { 

syro » i IdSyrns [symlndex J; 

if (< IsValidlmport ( sym }} continue; 

if (sym -> lsIFile-1 != filelndex) continue; 

GetName ( sym, & name. & namelength ); 

Addlmport ( symlndex, sym« name, namelength }; 

AddString ( nane, namelength, TRUE ); 

cnt**; 

) 

importFiles [filelndexl. nunlmports = cnt; 



// extract exports by section mnber 

for (sNum « kReExport Import; sHum < kNaxloadableSections; sMun*+) ( 

if (sNun 1= kReExportlinport) // ue already started accunulating exports of type -3 

; // loaderHdr tsNum]. expFirst » exportNext; 
for (symlndex = 0; symlndex < symCnt; symlndex**) ( 
sym s & IdSyros [symlndex]; 

if < IsValidlmport ( sym ) ) continue; // if it was an iB?)ort it was checked 

already 

// to see if it is an export 

if (I IsValidExport ( sym ) ) continue; 

seen = NapSecNuroToPeff ( sym -> IsScMun, & defAddr ); 

if ( seen Is sNiia } continue; 

GetName ( sym, ft nane, & namelength }; 

AddExport < seen, sym -> IsValue - defAddr, name, namelength. ClassOf < syn ) >; 
AddString ( name, namelength, FAISE >; 

for (symlndex = 0; symlndex < addExpCnt; symlndex**) { 
long xclndex; 

AddExpType *add » & addExpAry [symlndex] ; 
switch ( add •> format ) { 
case kEqui valence : 

xclndex - FindXCOFFtndexByNane ( add •> oldbiane, IdSyms, symCnt ); 
if ( xclndex < 0 ) { 

printf ("# Xs: syiribol 'Xs' not found for export 'Xs'\n", gToolNaoe, add •> 
oldname, add -> name); 
return 1; 

> 

sym = & IdSyms [ xclndex j; 

namelength = strlen ( (char*) add -> name ); 
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if < IsValidlniport ( sym ) ) { 
long implndex; 

if ( kReExportlmport != sNum ) break; 

implncjex = XCOFFSymlndexToPef f ImportHuoitoer [ xclndex J; 
5 AddExport ( kReExportlmpprt, iiaplndex, add -> name, nameUngth, ClassOf ( 

sym > >; . 
> 

else { 

_ ^ seen « NapSecNunToPeff ( sym -> IsScNun, & defAddr ); 

10 if ( seen !« sNum } break; 

AddExport < seen, sym -> IsValue - defAddr, add -> nane. nameLength. ClassOf 

( sym ) ); 
> 

Addstring ( add name, nameLength, FALSE ); 

15 break; 

case kAbsolute : 

if ( add -> seen != sNum ) break; 
naneLength « strlen < (char*) add -> name ); 

AddExport ( add -> seen, add -> offset, add -> name, nameLength, add -> class 
Addstring ( add -> name, nameLength, FALSE ); 

break; 

> 

25 , ' 

Checkweak (); 
return 0: 

30 

Mem_Error: 

printf (»# Xs: Could not allocate Xd bytesXn", gToolName, size); 
return 1; 
35 > /• SymCvt <) •/ 



int SymSort ( void ) 

i 

40 long i; 

long s; 

long • exportL inks lot; 
long hashSlotSize; 
long size; 
45 long oldlndex; 

long hashSlot; 
long oldcc; 
int hsincr = 0; 

50 // compute size of hash slot table 

IdHdr. hashSlotTabSize = CFLHashSlotBits ( IdHdr. nuiCxportSyms ); 

hashSletSize a i « idHdr. hashSlotTabSize; 

55 export HashSlot = (HashSlotEntryPtr) 

Kalloc ( size = sizeof (HashSlotEntry) * hashSlotSize >; 
if (I exportHashSlot) goto Hen Error; 
clearbytes < (byte *) exportHashSlot. size >; 

60 exportL inks lot = (long *) 

Malloc ( size = sizeof (long) * IdHdr. numExportSyms ); 
if (! exportL inks lot) goto Hem Error; 
clearbytes ( (byte *> exportLinkSlot, size ); 

65 for (i 3 0; i < IdHdr. nuiCxportSyms; i**) ( 



84 



hashSlot - CFLHashSlot ( exportHashChain [i}. hashword, IdHdr. hashSlotTabSize ); 
exportLinkSlot [i] - exportHashSlot ChashSiot], chainlndex; 
oldcc - exportHashSlot [hashSlot] . chainCount; 
exportHashSlot. [hashSlot]. chainCount++; 
exportHashSlot [hashSlot]. chainlndex » i; 
if (exportHashSlot [hashSlot]. chainCount 1" oldcc 1) < 
if < ♦♦haincr > 3) { 

printf (•># Xs: hash chain count overflow (quitting)\n", gToolMame); 

return -1; 

> 

printf ('•# Xs: Hash chain count overflow, using more slotsNn", gToolNane); 
Icflidr. hashSlotTabSire^-*; 
Free ( (byte *) exportHashSlot ); 
Free ( (byte *) exportLinkSlot >; 
goto top; 
> 

if ( exportHashSlot ChashSiot]. chainlndex I" i) i 

printf ("# Xs: hash slot value overflow (too many exports, quitting)\n'*, 
gToolName); 

return -1; 

> 



sortedExportSynbols = (LoaderExportPtr) 

Halloc ( size » sizeof (LoaderExport) * IdHdr. nurCxportSyms ); 
if (I sortedExportSymbols) goto Hein_Error; 
cLearbytes ( (byte *) sortedExportS^nbols, size ); 

sortedKashChain « (HashChainEntryPtr) 

Malloc ( size s sfzeof (HashChainEntry) • IdHdr. nunExportSyms ); 
if (! sortedHashChaIn) goto Hem_Error; 
clearbytes ( (byte *) sortedHashChain^ size ); 

sortedExportNext = 0; 

// convert exportHashSlot in place, 

// put map of old --> new into indexMap 

for (s = 0; s < hashSlotSize; s++) < 

oldlndex - exportHashSlot [s]. chainlndex; 

exportHashSlot [s]. chainlndex = sortedExportNext; 

for (i s 0; i < exportHashSlot [s] . chainCount; i+*) C 

sortedExportSyn^ols [sortedExportNext] = exportSyniMls [oldlndex]; 

sortedHashChain [sortedExportNext] s exportHashChain [oldlndex]; 

sorted£xportMext++; 

oldlndex = exportLinkSlot [oldlndex]; 

> 

) 

Free ( (byte •) exportLinkSlot ); /• llwow, we alnnst care about nemory disposal! I •/ 
return 0; 

Mem_Error: 

printf (•«# Xs: Could not allocate Xd bytesW*, gToolName, size); 
return \; 
y /* SymSort () */ 



int AddUeaklmport ( byte ^libName, byte •name ) 

Weaklmport *ptr = (Weaklnport •) NewPtr ( sizeof (Weaktnport) ); 

if (! ptr) return -1; 

ptr -> libName » libNane; 

ptr •> name = name; 

ptr -> next = UeaklmportList; 

UeakliifwrtList = ptr; 
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return 0; 
} /* AddUeaklnport (> •/ 



5 int CheckUesIc ( void ) 
C 

Ueaklfflport *ptr; 
long start, end; 
long fUelndex; 
10 int UbNaneLen; 

Int InpNameLen; 



15 



for (ptr = UeaklmportList; ptr; ptr « ptr -> next) { 

start «• end = 0; 

tibNanwLen s strlen ( (char *) ptr •> libMame ); 
for ( fUelndex = 0; filelndex < IdHdr. numlmport Files; fflelndex-M^ ) ( 



ImpNaneLen - strten ( (char *> (stringTable *■ ImportFUes [filelndex]. 
20 fileNaneOffset) }; 

if ( inpNamelen t« UbNameLen |] 

I comparebytes ( stringTable importFiles [filelndex]. f i lellaneOffset, 
ptr -> libName, 
UbNameLen ) ) continue; 

25 

start - importFiles (filelndex!. infjFirst; 

end - start + importFiles (filelndex]. numlnports; 

for <; start < end; start«+ ) < 

if ( strcmp ( (char*} (stringTable + importSynbols [start]. naneOffset), 
30 (char*) ptr -> name) 1= 0 > 

continue; 

importsymbols [start}. symClass |s kUndef ImpOK; 
// we found the s^Aiibol whose name matches, so stopi 
break; 

35 > 

// we found the file whose library nanw matches, so stopI 
break; 

40 if (start >= end) 

printf ("# Xs: warning, symbol 'Xs' in file 'Xs' not found\n", gToolName, ptr •> 
name, ptr •> UbName); 
) 

45 for ( filelndex = 0; filelndex < IdHdr. numlmportFiles; filelndex++ ) C 

If ( importFiles [filelndex]. initBefore & kWeaklmport ) 
ChecklnportsForUndefOK ( filelndex ); 

> 

50 return 0; 

> /* CheckUeak () */ 

void checklmportsForUndefOIC ( int filelndex ) 
long start, end; 

start ° importFiles [filelndex]. impFirst; 
end = start * importFiles [filelndex]. nurolmports; 
60 for (; start < end; start+i^ ) ( 

if ( importsymbols (start). symClass & kUndeflmpOK ) continue; 
printf Xs: warning, syiibol 'Xs* is not narked as weak, but Its library \**Xs\" 
is.\n", 

gToolName, 

65 StringTable * importsymbols [start]. nameOffset, 
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stringTable ♦ importFiles [fUetndexl. f i leNameOff set ); 

> 

> /• ChecklirportsForUndefCMC <) */ 



long RelocAddr ( TLdRelPtr reloc ) 

long relpos = (long) reloc -> IrVAddr; 

10 if (reloc -> IrRSecNm == secNum (DATAINDEXl) i 

relpos dstaVAddr; 

> 

else if (reloc -> IrRSecNm » secMum [BSS INDEX]) < 
relpos •» bssVAddr; 
15 relpos datalen; 

> 



20 



25 



30 



return relpos; 
> /* RelocAddr (> •/ 



long Reloclndex ( TLdRetPtr reloc ) 
{ 

long r Index = reloc -> IrSynNdx; 

if (rindex <» BSStNDEX) 
rindex = DATAINDEX; 
return rindex; 
) /• Reloclndex () */ 



int DataAdjust c TLdRelPtr reloc, long count > 
< 

while (count > 0} < 
35 long rindex = reloc -> IrSynNdx; 

long •rAddr; 

if (reloc -> IrRSecNn fo secNum [MTAIMDEX] U 
reloc -> IrRSecNm i« secNun [BSSINDEX]) <. 
40 printf i"1t Xsi not converting relocation; relocations not allowed in eodeVi", 

gToolNanie); 

return 1; 

> 

45 if (rindex = CODEINDEX) { 

if (reloc •> IrRSecNm == secNun [BSSINDEX]) < 
if (I optBSSExpand) { . 
ernnsg : printf ("# Xs: relocation of BSS to BSS not convertable into PEF format without 
-b optionVn", gToolName); 
50 printf (•># item at OxXOSlx in BSS\n», reloc •> IrVAddr); 

printf («# relocate item Into (initialized) DATA to properly 

convert\n"); 

return 1; 

55 rAddr « (long *) (bssStart * RelocAddr ( reloc ) - dataLen); 

> 

else < 

rAddr s (long •) (dataStart ♦ RelocAddr ( reloc ) ); 

60 "rAddr -= codeVAddr; 

•rAddr += peff CodeVAddr; 

> 

else if (rindex == DATAINDEX) < 

if (reloc •> IrRSecNm s» secNun (BSSINDEX]) { 
65 if (I optBSSExpand) goto errmsg; 
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rAddr » (long *) (bssStart * RelocAddr ( reloc ) - dataLen); 

) 

else < 

rAddr = (loog •) (dataStart ♦ RelocAddr ( reloc ) ); 

5 ) 

*rAddr -= dataVAddr; 
•rAddr peff DataVAddr; 

> 

else if (rlndex == BSSINDEX) < 
10 rlrtdex = OATAINDEX; 

If (reloc •> IrRSecNm == secNun [BSSINDEX]) < 
if (! optBSSExpand) goto errmsg; 

rAddr « (long *) (bssStart ♦ RelocAddr ( reloc } - dataLen); 

15 else < 

rAddr = (long *) (dataStart ♦ RelocAddr ( reloc > ); 

> 

•rAddr -= bssVAddr; 
* rAddr *s dataLen; 
20 * rAddr peff DataVAddr; 

> 

count - - ; 
reloc**; 

25 return 0; 

> /* DataAdjust () V 



long fndex = 0; 

30 

#defi'ne kRptBufSize 8 // keep power of 2 

^define kRptBufHask (kRptBufSize-1) 
typedef Reloclnstr TRptSuf [kRptBufSize] ; 
TRptBuf rptBuf; // last n Mords of relocation 

35 int rptDist; // match found hoH far back? 

fnt rptRest; // nunber of fnstrs in current partial match 

int rptRunCnt; // nunber of conplete matches 



40 void PrtReloc ( Reloclnstr w1 ) 

{ 

relocations [relUordCnt**] = Ml; 
> /* PrtReloc () */ 

45 

CheckRelocNatch ( Reloclnstr w1, Int matchstart ) 
< 

int i; 

for (i = matchstart; i <= kRptBufSize; i++) ( 
50 if (M1 = rptBuf ( relWordCnt-i & kRptBufHask )) < 

rptDist = i; 

rptRest =1; // first word of match 

break; 



> /* CheckRelocNatch () •/ 



void RestartMatch ( Reloclnstr u1 ) 
60 { 

int relCopy = relWordCnt; 
int rptRunCopy = rptRunCnt; 
int rptRestCopv = rptRest; 
int rptOistCopy * rptDist; 
65 TRptBuf rptCopy; 
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// don't emit a RPT 1,1 
if <rptRunCnt == 1 && rptDist == 1) { 
rptRunCopy = rptRunCnt = O; 
5 rptRestCopy - rptRest = 1; 

> 

// output completed runs; set rptRunCnt to 0 
if (rptRunCnt > 0) { 
10 if (rptRunCnt <> ksRPTHAX) { 

PrtReloc ( (krRPT « RELOPSHFT) ♦ (rptO!st-1«8) ♦ rptRunCnt - 1 ); 

> 

else < 

PrtReloc ( (krLRPT « RELOPSHFT) + (rpt0ist-1«6) ♦ (rptRunCnt » 16) ); 
15 PrtReloc < rptRunCnt & 65535 >; 

> 

} 

// In case of an incomplete run, whUh ue will want to add to thru 
20 // repeat buffer, save the repeat buffer to knou uhat the partial 

// sequence was 

movefaytes ( (byte •) rptBuf , (byte *) rptCopy, sizeof rptCopy ); 

// if a run was sent, then clear the match buffer 
25 // otherwise, don't 

if < rptRunCopy) C 

clearbytes ( (byte *) rptBuf, sizeof rptBuf ); 

rptDistCopy a 1; //and set the distance of first ok match to 1 

// (no match will be found, however, AddReloc 
30 // will know that a sequence is being restarted 

// when ft sees that matchStart > 1) 

> 

rclC^ -= rptDist; 
35 rptRunCnt = 0; 

rptDist = 0; 
rptRest = 0; 

Hhile (--rptRestCopy >= 0) < // output incomplete match 

AddReloc ( rptCopy I relCopy ft kRptBufNask ], rptOistCopy^l ); 
40 rptDistCopy » 0; 

relCopy**; 

> 

AddReloc ( w1, 1 ); 
> /* RestartNatch <) */ 

45 

void AddReloc ( Reloclnstr w1, int matchStart ) 
static int secondUord; 

if (rptDist) < // is repeat buffer started? 

// yes- check if next word continues repeat 
if (w1 && w1 ~ rptBuf t relWordCnt-rptOist+rptRest & kRptBufMask ]) { 
_^ rptRest**; // yes- check if match complete? 

55 If (rptRest rptDist) < 

rptRunCnt**; // yes- incremnt run count 

^ -rptRest = 0; // clear count of uncompleted run 

60 else < // no- match continuation failed 

RestartNatch ( w1 }; 

} 

return? 

65 ' 



50 
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// case of no partial match started yet 

< //no - check for match anywhere 

if (matchStart > 1) secondUord - 0; // sequence is being restarted by Restart 



boundary 



// restarted sequence must start on instr 



// because match started on Instr boundfitry 
if (I secondUord && u1) // only check if start of instr 

CheckRelocNatch ( w1, matchStart ); 
. _ (rptDist) i // matched soneuhere? 

10 H (rptRest = rptOfst) { // yes- is match conplete? 

rptRunCnt**; // yes- increment run cnt 

rptRest = 0; // set rest cnt to 0 

) 

15 else if (Ml) { // no • add to match buffer, output 

rptBuf [ relUordCnt & kRptBufMask] = h1; 
PrtReloc ( Hi >; 

secondUord = < secondUord ? 0 : IsLARG < wl » RELOPSHFT ) ); 

20 > ^ 

) /• AddReloc <> •/ 

void dorform ( int op, long run, long max ) 
int cnt; 

Hhfle (run > 0) < 

cnt =■ run > max ? max : run; 
30 Addleloc < (op « RELOPSHFT) ♦ cnt - 1, 1 >; 

run cnt; 

> 

> /* dorform () •/ 



void doform ( int op, long cnt ) 
€ 

AddReloc ( {op « RELOPSHFT) ♦ cnt, 1 ); 
> /• doform <> •/ 



void dolform { int op, long offset ) 
{ 

AddReloc ( (op « RELOPSHFT) ♦ ((offset » 16) & 1023), 1 ); 
45 AddReloc ( offset & 65535, 1 ); 

> /• dolform (> •/ 

void dodform ( int op, Int delta, int run ) 
50 ( 

AddReloc ( (op « RELOPSHFT) ♦ (delta « 4) ♦ run, 1 ); 

> /* dodform (> •/ 

55 void dodelta ( long relpos, long delta ) 
< 

ff (! delta) return; 

if (delta >s 0 M delta ksOELTA) 

doform ( krOELT, delta-1 ); 
60 else 

dolform ( krUBS, relpos ); 

> /• dodelta () V 

65 Int RelCvt ( TLdRelPtr reloc, long count ) 
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long curpos = 0; 
long re I pes; 
long delta; 
5 long run; 

long rindex; 

long symlndex ° 0; 

long peff Index; 

10 DstaAdJust ( reloc, court ); 

relocations - (Reloelnstr *) Malloc ( count * sfzeof (Reloclnstr) * 2 * 2 ••■ 8 ); 
// max possible size of relocation: old count * 2 bytes per reloclnstr 
// * 2 for long form of reloc instr • 2 for two instrs 



15 



if (! relocations) return -1; 



if ( count > 0 ) ( 

if (codeSectNum >= 0) 
20 if ( codeSectMim 0} 

doform ( krCOIS, codeSectHun ); 

if (dataSectNun >= 0) 
if ( dataSectNun 1 ) 
25 doform ( krOTIS. dataSectNun ); 

> 

whi le (count > 0) { 

relpos s RelocAddr ( reloc }; // address to relocate 

30 delta » relpos ♦ curpos; // delta from last address 

curpos 3 relpos; // new curpos is addr to relocate 

run = 1; 

35 // look for run of 

// SYM,SYM*1,SyH+2... 

I rIndex = Reloclndex ( reloc ); 

40 if (rindex > BSSINDEX) < 

peff Index « xCOFFSynlndexToPeff IirportNunber (rindex - kSpecialSynbol Count]; 
if (peff Index == symlndex) { 
while (count 2 && 

RelocAddr ( reloc+l ) == curpos+4 && 
45 XCOFFSymlndexToPefflmportNumber [Reloclndex ( reloc-»1 ) - 

kspec i a I Symbol Count ] 

= symlndex+1) { 

curpos ♦= 4; 
reloc**; 

50 symlndex-^; 

run**; 
count- •; 

} 

> 

55 

dodelta ( relpos. delta ); 

if (run > 1 11 peff Index ~ symlndex) { 

dorform ( krSYHR. run. ksISNAX ); 

symlndex*^; 

60 > 

else ( 

if (pefflndex >« ksiSMAX) 

dolform ( krLSYM, pefflndex ); 
else 

65 doform ( krSYMB, pefflndex ); 
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symlndex = peff Index-tl; 

> 

curpos 4; 
reloc**; 
5 index += run; 

count--; 
continue; 

> 

10 // look for run of: 
// code, 
// data 

while (comt >= 2 && 
15 RelocAddr ( reloc-M ) = curpos+4 && 

Reloclndex ( reloc+1 ) == rindex) { 
curpos +=4; 
reloc**; 
run**: 

20 count--; 

> 

// no single run found, 
// look for run of: 
25 // desc(code/data/blank) 
// vtbKdata/blank) 

ff (run == 1 && rindex == 0) { // desc <12 byte fonn) 

MhUe (count >« 2 && 
30 RelocAddr ( reloc } curpos ft& 

Reloclndex ( reloc > == CODEINDEX && 
RelocAddr ( reloc+1 ) == curpos^4 && 
Reloclndex ( reloc+1 > DATAINDEX && 
<count " 2 Jl RelocAddr ( reloc-(>2 ) >a curpoS'^12) > < 
35 curpos 12; 

reloc ♦= 2; 
run^; 
count -= 2; 

40 If (rim > 1) < 

run--; 

dodelta ( relpos, delta }; 
dorforro ( krOESC, run, ksOEMAX ); 
index ♦= run*2; 
45 continue; 
> 

while (count >= 2 && // desc (8 byte form) 

RelocAddr ( reloc ) — curpos ft& 
Reloclndex ( reloc ) == CODEINDEX && 
^0 RelocAddr ( reloc-fl ) ■* curposH U 

Reloclndex ( reloc^l ) DATAINDEX) C 
If (count > 2> < // try to exclude 12 byte form 

long addr » RelocAddr ( reloc-*-2 ); 
if (addr < curpos+8 {| addr s= curpos+12) break; 

35 > 

curpos ♦= 8; 
reloc *= 2; 
run^; 
count -« 2; 

60 > 

if (run > 1) { 
run- • ; 

dodelta < relpos, delta ); 
^ dorform ( krOSCZ, run, ksDEHAX ); 

o5 index ♦= run*2; 
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continue; 

> 

) 

else if (run i && rindex ss 1) { // vtbl 

5 ^ while <count >= 2 && 

RelocAddr ( reloc+1 ) = curpos*8 && 
Reloclndex ( reloc+1 ) == DATAINDEX && 
(count » 2 ;; RelocAddr ( reloc+2 ) >& curpos+ld) ) < 
curpos *B 8; 
10 reloc**; 

run^; 
count--; 

) 

if (run > 1) ( 

15 /* 

if (run <= ksOVRHAX) ( 

if (detta > ksOVDHAX (delta & 3)} C 
dodelta ( relpos, delta }; 
delta « 0; 

20 } 

dodform ( krDVBL, delta « 2, rtn-l ); 

> 

else 

25 *' 

dodelta ( relpos, delta ); 
dorform ( krVTBL, run, ksVTMAX ); 

> 

curpos ♦= 8; 
30 reloe^; 

index ♦= run; 

count--; 

continue; 

35 

if ( rindex == DATAINDEX run <= ksDDRNAX } i 
if (delta > ksDDDNAX |[ (delta & 3)) C 
dodelta ( relpos, delta ); 
40 delta « 0; 

> 

dodform ( krODAT, delta, run }; 

> 

else { 

45 dodelta ( relpos, delta ); 

dorfono ( rindex 0 7 krCOOE : krOATA, run, ksDATA ); 

) 

curpos 4; 
reloc**; 
50 index *s run; 

count-*; 

> 

// flush rpt buffers ■ with the unmatchable and reserved instruction opcode. 
55 AddReloc ( 0. 1 ); 



60 



return 0; 
> /• RelCvt () •/ 



int FindStuff ( byte *theContainer ) 

fileHdrPtr = (TFileHdrPtr)theConteiner; 
auxHdrPtr = (TAuxHdrPtr)(f i leHdrPtr ♦ 1); 
65 scnHdrs « (TScnHdrPtr)(euxHdrPtr +1); 
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10 



IdrScnHdr s & scnHdrs CauxHdrPtr -> ahSnLoader-l] ; 

IdHdrPtr = (TLdHdrPtr) {theContainer ♦ IdrScnHdr -> shScnPtr); 

IdSyms = (TLdSyn#»tr> (IdHdrPtr + 1); 

IdRelocs = ^TLdRelPtr) (IdSytns ♦ ldHdrPtr->lhNSyms); 

IdStrlngs = (byte *) IdHdrPtr ♦ IdHdrPtr- >lhSTOff; 

secNum [01 = auxHdrPtr -> ahSnText; 

secNun CI] » suxHdrPtr -> ahSrOata; 

secNun Q] s auxHdrPtr •> ahSnBSS; 



15 



20 



25 



30 



35 



40 



45 



codeScnHdr = & scnHdrs [ secNun (0) • 1 ]; 

codestart = theContainer + codeScnHdr -> shScnPtr; 

codeLen = codeScnHdr •> shsize; 

codeVAddr = (long) codeScnHdr -> shVAddr; 

dataScnHdr - & scnHdrs [ secNum [1] - 1 ]; 

dataStart a theContafner ♦ dataScnHdr -> shScnPtr; 

dataLen « dataScnHdr •> shSUe; 

dataVAddr - (long) dataScnHdr •> shVAddr; 

bssScnHdr = & scnHdrs [ secNun (2} • 1 ]; 
bssLen = bssScnHdr -> shSize; 

bssVAddr = (long) bssScnHdr -> shVAddr; 

if (optBSSExpand) < 

bssStart s Ma Hoc ( bssLen ); 

cleaHaytes ( bssStart, bssLen ); 

J 

******************«****««***********«*«*«««*«*****«**********«** 

ff (auxHdrPtr •> ahSnText) 
( 

= & scnHdrs C secMun 10) -11; 
= theContainer ♦ codeScnHdr -> shSenPtr; 

- codeScnHdr -> shSize; 

- (long) codeScnHdr •> shVAddr; 



I******************/ 



codeScr^dr 
codestart 
codeLen 
codeVAddr 

) 

else 

C 

codeScnHdr 
codestart 
codeLen 
codeVAddr 

> 



= 0; 
» 0; 
« 0; 
= OL; 



if (auxHdrPtr -> ahSnOata) 



50 



55 



60 



dataScnHdr 
dataStart 
dataLen 
dataVAddr 

) 

else 
< 

dataScnHdr 
dataStart 
dataLen 
dataVAddr 



B & scnHdrs C secNum [1] - 1 1; 

s theContainer * dataScnHdr •> shScnPtr; 

" dataScnHdr -> shSize; 

> (long) dataScnHdr •> shVAddr; 



= 0; 
= 0; 
" 0; 
= OL; 



65 



if (auxHdrPtr -> ahSnBSS) 

( 



bssScnHdr 
bssLen 



= & scnHdrs ( secNun 121 ■ 1 J; 
= bssScnHdr ■> shSize; 
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bssVAddr = (long) bssScnHdr •> shVAddr; 

if (optBSSExpand) 

< 

bssStart Halloc ( bssLen ); 
5 clearbytes ( bssStart, bssLen ); 

) 

else 

< 

10 bssSoiHdr B 0; 

bssLen s 0; 

bssVAddr s QL; 

if (optBSSExpand) 

15 bssStart = (byte*)0; 

> 

) 

20 locNun tO] ' 0; 

locNim [1] B 0; 

locNum [2] B scnNdrs [auxHdrPtr -> ahSnData • 1]. shSize; 

symCnt = IdHdrPtr •> IhMSyms; 
25 relent = IdHdrPtr -> IhNReloc; 

return 0; 

> /• FindStuff () V 

30 

#if 0 

long FindExportSeetienNunber ( long exportNisnber > 
long seen; 

35 //--return exportSynbols CexportNuiter) . sectionNunber; 

for (seen = kReExport Import; seen < kHaxLoadabteSecttons; secrH-^) { 
if (exportNuit>er >s loaderHdr Csecn] . expFirst && 

exportNurter < loaderHdr [secnj. expFlrst ♦ loaderHdr [seen]. nuoExports) 
4U return seen,- 

> 

return -1; 

> /* FindExportSectionNunber () */ 
45 0end1f 

«if 0 

int FindSymbol { byte *name, long *secn, long *offset ) 

Int f; 

int length; 

•seen « -1; 
55 *offset = - I; 

if (! name) return 0; 

length = strlen { (char*) name ); 

60 for (i s 0; i < IdHdr. minExportSyns; i++) { 

if ( (exportSymbols [1). hashuord » 16) I* length) continue; 

if ( carparebytes ( exportSymbols [il. nameOffset ♦ stringTable, name, length ) ) { 
*secn B FindExportSectionHusber ( i ); 
- •offset = exportSynbols [1], address; 

o5 return i; 
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20 



30 



40 
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> 

> 

firintf ("# %s: SynAxil Xs not foundVn". gToolNane. name); 

return -1; 
> /* FindSymbol <) */ 
#endff 



long FindXCOFFIndexByName ( byte *naine, TLdSymPtr IdSyms, long symCnt ) 

<. 

TLdSymPtr sym; 
long symlndex; 
15 int nameLength; 

int xnamlen; 
byte *xnaffl; 



nameLength " strlen ( (char *) name ); 



for (symlndex * 0; symlndex < symCnt; symlndex-M^) { 
sym s & idSyiRS [symlndex] ; 

If ( I IsValfdloport ( sym > && ! IsValfdExport ( sym ) } continue; 
GetName ( sym, & xnam, & xnanLen ); 
25 if ( xnamLen -- nameLength && 

conparebytes ( xnam, name, nameLength ) > return symlndex; 

> 



return -1; 
> /* FfndXCOFFIndexByNaine () */ 



fnt CheckLlbOpt ( byte **namep, int ViameLengthp, TVerston *linkedVer, TVersfon *oldIinpVer, 
35 TBoolean *iB ) 
( 

int i; 

byte •libStr; 
Int nl; v 



•Unkedver = koef Version; 
*oldIinpVer « kOef Version; 
*iB = 0; 



45 for (1 = 0; i < IfbOptsr i++> ( 

UbStr = liblnfo [f3; 
nl = mchlen ( libStr, (byte *) "=#-!-•» >; 

if ( nl =r *nameLengthp U conparebytes ( IfbStr, ^namep, nl > } < 

for (;;) { 

50 switch ( libStr [nl] ) < 

case '\0': break; 
case : libStr *- nl ♦ 1; 

nl = mchlen ( HbStr, (byte •) "»#•!-" ); 
•nam^ = libStr; 
55 *nameLengthp b nl; 

continue; 

case : *oldlirpVer = *l{nkedver - Getint ( libStr <«■ nl ♦ 1, ft llbStr, 0 ); 
nl s 0; continue; 

case : *oldlmpVer » Getint ( libStr ♦ nl ♦ 1. ft libStr, 0 ); nl - 0; 

60 continue; 

// case ')' : nl**; continue; 

case : nl**; *iB |a klnitBefore; continue; 
case : nl*+; •iB |a kUeaklnnport; continue; 

65 break; 
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> 

break; 

> 

} 

return 0; 
> /* CheckLlbOpt () V 



tnt ProcessExportLfst ( void ) 
10 ( 

int err; 

byte •buf, *ptr, *end; 
long I en; 
long index; 



15 



20 



err - Openftead ( expFi leName. ft buf, & len ); 
if (err) return err; 

end s buf <^ len; 

for i ptr s buf; ptr < end; > { 
if (•ptr** « »\n') addExpCnt**; 

> 

25 addExpAry = (AddExpType *) Malloc ( addExpCnt * sizeof (AddExpType) ); 

if (! addExpAry) { 

printf (•'# Xs: Could not allocate %d bytesXn", gToolName, addExpCnt * sizeof 
(AddExpType)); 
return -1; 

30 > 

index = 0; 

for ( ptr = buf; ptr < end; ) ( 

while (ptr < end && <*ptr == • ' |{ •ptr « '\t')) ptr**; 
35 if < *ptr = '3' ) { 

ptr++; 

if ( ptr >= end ) goto ExpErr; 

// format aoldname neuname 
addExpAry tindexl. fomiat = {(Equivalence; 
40 addExpAry (Index], oldname = ptr; 

uhile (ptr < end && 

(•ptr !■ • ' M •ptr f= »\V U ♦ptr !» »\r» U *ptr !« '\n')> ptr++; 
if (ptr >B end) goto ExpErr; 
*ptr** = 0; 

45 > 

else ( 

addExpAry (index], fortnat = kAbsolute; 
// format sec#:offset class newname 
BddExpAry (index), seen = Getint ( ptr, & ptr, end ); 
50 if (*ptr** !« j| ptr > end) goto ExpErr; 

addExpAry tindexl . offset « Getint ( ptr, & ptr, end >; 
while (ptr < end M (•ptr = » ' |j *ptr == '\t')) ptr**; 
if (ptr >e end) goto ExpErr; 

addExpAry [index], class s Getint ( ptr, & ptr, end); 

55 > 

while (ptr < end (♦ptr = ' ' }j •ptr == '\t')) ptr++; 
if (ptr >= end) goto ExpErr; 
addExpAry (index] . name -ptr; 

while (ptr < end && ♦ptr !» '\r' && *ptr l» '\n') ptr**; 
60 if (ptr >- end) goto ExpErr; 

♦ptr** s 0; 
index**; 

) 

65 //-- don't unriap, its in use! ReleaseFile ( buf, len ); 
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return 0; 
ExpErr ; 

printf ("# Xs: error in export list, line Xd\n«. gToolName. index-1>; 
5 return -1; 

> /• ProcessExportList <) */ 

. int CheckXCOfF ( byte *theContainer ) 
10 i 

TFfleHdrPtr fileHdrPtr = (Tf i leHdrPtr)theContainer; 

TAuxHdrPtP auxHdrPtr « (TAuxHdrPtrXf ileHdrPtr ♦ 1); 

/* Verify that the container Is valid XCOFF. The "file" header must have */ 
15 /* the right magic number, the right size for the auxiliary header, end •/ • 

/* executable and dynamic load flags must be set. The auxiliary header */ 
/* Bust have the right version nutrber and non-zero section numbers for ♦/ 
/* the .text, .data, .bss, and .loader sections. 
V 

20 /• 

if (fileHdrPtr->fhMagic != AIXPo«erPCMagic •} 
fileHdrPtr->fhOptHdr 1= sizeofCTAuxKdr) 11 
! IsExec(MileHdrPtr) |J 

25 I IsOynLoad(*fileHdrPtr) ) return badUnitErr; 

if (auxHdrPtr->ahVStairp !» kXCOFFVersion || 

8uxHdrPtr->ahSnText «= 0 || 

30 auxHdrPtroahSnData s= 0 ; J 

auxHdrPtr->ahSr«SS =° 0 || 

auxKdrPtr->ahSnLoader «: 0 > return badUnitErr; 
*******•***********•*•*********, 



50 



35 if (a*ixHdrPtr*>ahVStainp 1= kXCOFFVersion j| 

auxHdrPtr->ah$nL08der — 0 ) return badUnitErr; 

return 0; 
40 > /* CheckXCOFF () */ 

int OpenRead ( byte •fMame, byte "buffer, long *length ) 

45 short refMuro; 

byte *ptr; 
long len; 
long err; 
StrZSS pName; 



err ■ HakePStr ( fNane, strlen ( (char*) fName ), pName, sizeof pNaine ); 
if (err) return err; 



err = KOperDF ( 0, 0, pName, fsRdPenn, & refNun ); 
55 If (err) { printf ("# Xs: Could not open input file Xs »d\n", gToolName, fName, err); 

return -1; > » » #» 

err a GetEOF ( refNun, & len ); 

if (err) C FSClose ( refNum ); printf ("# Xs: Could not seek In Xs Xd\n". gToolName. 
50 fName, err); return -1; > ' 



65 



ptr ■ (byte •> NeuPtr ( len ); 

if (I ptr) c FSClose ( refNun >; printf C*» Xs: Could not allocate Xd bytesVn". 
gToolName, len); return -1; > 
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err = FSRead ( refNutn, & len, ptr ); 

if (err) { FSClose ( refNum ); printf (•*« Xs: Could not read Xs XdVn", gToolNatne, fNa 
err); return -1; > 

5 FSClose ( refNun }; 

•buffer = ptr; 
•length ■ len; 

10 return 0; 

> /• 0per«e8d () V 

void fflovebytes ( byte *s, byte •t, long length ) 
15 { 

Hhile (--length >a O) *t** s *s*+; 
) /* nnvebytes () */ 

20 void clearbytes ( byte *t, long length ) 

Mhile (--length 0) •t** « 0; 

> /* clearbytes (> */ 

25 

int conparebytes ( byte *s, byte *t, int length ) 
< 

while (--length >- 0) If (*s** !» *t**) return 0; 
return 1; 
30 y /* conparebytes <) V 

byte mem [11; 

byte •Malloc ( long len ) 
35 { 

extern byte *malloc ( long len }; 

if (I len) return mem; 
return (byte*) (maUoc ( len »; 
40 ) /• Halloc () V 

void Free ( byte *tofree ) 

45 ^pragma unused(tofree) 

extern void free < byte * >; 

/* ha! don*t free anything, 

since we are almost never doing so anyway, 
50 and ue are mixing NewPtr with Natloc (yuk) 

if ( tofree Is mem > 
free (tofree); 

*/ 

> /* Free () */ 

55 

long Write ( int fd, byte *buf, long len ) 
€ 

int err; 

if (• len) return 0; 
60 err > FSUrite ( fd. & len. buf ); 

if (err) return err; 
return len; 

> /• Write () V 

65 
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long Getint ( byte •ptr, b/te •*out, byte *end ) 
{ 

Boolean neg = 0; 
long value s O; , 
5 fnt mp s 10; 

int off; 

if (I end) end « ptr « 16; 

uhfle <ptr < end ft& ("ptr ' ' || •ptr ■= '\t')) ptr++; 
10 if (*ptr M '-') { neg » 1; ptr**; > 

if (•ptr «■ '0') < 
ptr++; 

if (*ptr »x'> { ptr++; mp » 16; 3 
else < np s 8; > 

15 > 

while <ptr < end && *ptr) < 

if (*ptr >s »0' && *ptr <= '9') off = '0'; 
else if (*ptr >= 'a' U •ptr <» 'f) off ■ 'a' • 10; 
20 else if (•ptr >■ 'A» && •ptr <» 'F') off = 'A' - 10; 

else break; 
value *■ mp; 
value +0 •ptr+4- • off; 

25 if (out) 

•out a ptr; 
return neg 7 -value : value; 

> /• Getint () */ 

30 

int mchlen ( b/te •str, byte *iBch ) 
{ 

int len = 0; 
byte •mptr; 
35 byte ch; 

for (;;> { 
ch = •str**; 
if (I ch) return len; 
40 nptr ■ mch; 

uhile (*inptr) if <*i»ptr++ =» ch) return len; 
len**; 

> 

> /* mchlen () */ 



45 



50 



55 



60 



int LogZ ( unsigned long aln ) 
{ 

int i; 
aln--; 

for (i = 0; aln; i**) aln »= 1; 
return i; 
y /* LogZ () */ 



OSErr NakePStr ( byte •src, int len, byte ♦trg, int max ) 

if <len >= max) return ioErr; 

novebytes ( src, trg*1, len ); 
*trg a (byte) len; 



return noErr; 
65 > /* MakePStr () */ 
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50 



/* UMIX -> MAC date convert. Runs on UNIX only */ 
#include <time.h> 
long HscTime(long unixTime) 
static time_t convert = 0; 
if ( unixTime 0 ) return 0; 



if (! convert) 

i 

15 tiine_t gmt = time(O); 

struct tm local = *localti[ne(&9mt}; 

convert » mktimet&local) • aiktime(gn)time(&gint)); 
ifdocal.tm Isdst) 
20 convert *« 60 * 60; 

convert 0x7e25b080; /* magic nunber for 1/1/70 in mac time */ 
) 

25 return unixTime ■•- convert; 



30 void eonvertStringZLong (byte* theString, unsigned long* theLong) 

*theLong = ((longXtheStringCO] « 24) ♦ (longXtheStringCH « 16) ♦ (long)(theString(2] 
« 8) ♦ <lono)theStringt3)>: 

35 ' 

#if MAC 

long GetFilelime ( byte *fileNanie ) 
{ 

40 ClnfoPBRec pb; 

OSErr err; 
Str255 pName; 

err = HakePStr < fileName, strlen < (char*) fiteNane ), pNane. sizeof pName ); 
45 if (err) return 0; 

clearbytes ( (byte *) & pb, sizeof pb ); 

pb. hFilelnfo. ioNamePtr s pName; 

err s PBGetCatlnfo ( ft pb, FALSE ); 
ff ( err ) return 0; 



return pb. hFilelnfo. loFlMdDat; 
55 > /• GetFtleTfme (> •/ 
«endif 



60 



int CountCorapareBytes ( register byte *s, register byte -t, register int length ) 
65 int fullLength = length; 
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while {--length >= 0> if <*s++ |o n**) return ful ILength- length- 1; 
return ful (Length; 

> /* CountCofflpareBytes (} */ 

5 

long CountZeroBytes ( byte 'src,. long length } 
{ 

long cnt ■ 0; 

Hhfle ( —length >■ 0 > < 
10 If C •src'M. — 0 > cnt+*; 

> 

return cnt; 

> /* CountZeroBytes () •/ 



15 



45 



#define kHaxForwarcMatch 12 



typedef struct { 
in opcode; 
20 long matchlen; 

long diffcnt; 

long rptCnt; 

long backup; 

25 byte *ptr; 

byte •end; 

long coverage; 
long cost; 
30 long overhead; 

long gain; 
> Packlnstr, *PackInstrPtr; 



35 byte *gBlockStart; 
/* 

RBLK rptcnt, inatchCnt, diffCnt 

RBLK rptCnt=1, matchCnt=0, diffcnt=x ==> BLK x 

40 RBLK rptCntsx, natchCntsy, diffCntaO »=> RPT y,x 



byte *targetBuf, *targetEnd; 



OSErr EmItBytes ( byte *pc inter, long count ) 
{ 

register long cnt = count; 
50 register byte *ptr = pointer; 

fnt Ip - 0; 

if ( cnt <= 0 ) return noErr; 

55 if t targetBuf * cnt >« targetEnd > return -1; 

while ( --cnt >» 0 > { 
* targetBuf ♦+ = *ptr+*; 

60 ' 

«if DEBUG 

ptr = pointer; 

cnt » count; 
printf (" ">; 
Ob while ( --cnt >s 0 ) € 
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printf X02x", *ptr**); 
if ( **lp == 15 ) < 

printf ("\n"); 

printf (" , ••); 
5 Ip « 0; 

> 

> 

printf ("\n"); 

if ( debug ) fflush ( stdout ); 
10 #endif 



15 



25 



55 



return noErr; 
> /* EmitBytes () */ 



Packlnstr buflnst; 
Packlnstr neulnst; 



20 OSErr EoiitCount ( long count ) 

if ( count < 0 ) return noErr; 



if ( targetBuf >- targetEnd-5 ) return -1; 



if ( count <» 127 ) < 
* targetBuf +♦ = count; 

> 

else if ( count <■ (127«7) ♦ 127 ) < 
30 * targetBuf +'»^ ■ (count » 7> } 0x80; 

♦targetBuf** a count & 0x7f; 

> 

else if ( count <= ((127«7>+127«7) ♦ 127 ) ( 
*targetBuf++ = (count » U ) | 0x80; 
35 •targetBuf ++ = (count » 7) } 0x80; 

*targetBuf"M> = count & 0x7f; 

> 

else if ( count <= (((127«7)+127«7)*127«7) + 127 ) { 
* targetBuf +♦ ■ (count » 21 ) j 0x80; 
40 *targetBuf<M- s (count » 14 ) j 0x80; 

•targetBuf+4- s (count » 7> | 0x80; 
•targetBuf** « count & 0x7f; 

> 

else { 

45 •targetBuf » (count » 2S ) ; 0x80; 

•targetBuf** s (count »> 21 ) j 0x80; 
♦targetBuf +* a (count » 14 ) j 0x80; 
•targetBuf** » (couit » 7) | 0x80; 
•targetBuf** ■ count t 0x7f; 

50 ) 



return noErr; 
> /• EnitCount () •/ 



OSErr EmitOpcode ( byte op, long count } 
( 

if ( count « 0 > return noErr; 



60 if ( targetBuf >~ targetEnd ) return -1; 

if ( count <» kFirstOperancHask ) ( 

* targetBuf s (op « kOpcodeShif t) ♦ count; 
return noErr; 

65 > 
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•targetBuf** » (op « kOpcodeShift); 
return EnitCount ( count ); 
> /* EnitQpcode () *A 



OSErr EmitOnelnst ( byte ch, PacklnstrPtr ibp ) 

#prasma unusecKch) 
10 long cnt; 

byte *ptr; 
OSErr err; 



15 



if ( ibp -> opcode « kNoOpcode ) return noErr; 



#if DEBUG 

printf ("XcJJOAlx-mix", 
ch, 

Ibp -> ptr ■ gBlockStart, 
20 ibp -> end • gBlockStart - 1); 

#end{f 

switch ( fbp -> opcode ) ( 
case kZero : 

25 err B EmttOpcode ( kZero, ibp -> rptCnt ); 

if < err ) return err; 

// printf <" ZRO #J!d\n", ibp -> rptCnt ); 

mf DEBUG 

30 printf (" ZRO #Xd", ibp -> rptCnt ); 

printf <»\tv=Xd,c=Xd,o=Xd,9"31d\n'», 
fbp -> coverage, 
ibp •> cost, 
ibp -> overhead, 
35 ibp -> gain ); 



#endif 

case kBlock 



break; 



. err = EmitOpcode ( kBlock, fbp •> diffCnt ); 

40 if ( err } return err; 

// printf <" BLK iKd\n«, ibp -> dfffCnt >; 

«if DEBUG 

printf BLK jWd", ibp -> dfffCnt >; 
printf ("\tv»Xd,c=Xd,o«%d,g-Xd\n", 
45 ibp -> coverage, 

ibp -> cost, 

ibp •> overhead, 

fbp -> gain ); 

#endif 

50 err s EnftBytes < ibp ■> ptr, Ibp •> diffCnt ); 

if ( err ) return err; 
break; 

case kRepeat: 

- err = EmitOpcode ( kRepeat, ibp -> diffCnt ); 

3J> if < err > return err; 

printf (" RPT #%d*!«d\n", ibp -> tnatchLen, ibp -> diffCnt ); 

#lf DEBUG 

printf (" RPT nd*Jtd", ibp -> matchLen, ibp -> diffCnt ); 
printf ("\tv«xd,c=%d,o*Xd,g3!6d\n", 
oU ibp -> coverage, 

ibp •> cost, 

fbp -> overhead, 

ibp -> gain ); 

«endif 

03 err s EmitBytes ( ibp -> ptr, ibp -> matchLen ); 
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25 



30 



35 



40 



45 



50 



55 



60 



65 



if < err ) return err; 
break; 

case kRepeatBlock : 

if J ibp -> diffcnt == 0 ) C 

err - EmitOpcode ( kRepeat, ibp -> tnatchLen ); 
if ( err ) return err; 

> 

else < 

err - EmitOpcode ( kRepeatBlock, ibp •> tnatchLen >; 

if ( err ) return err; 

err « EmftCount ( ibp -> diffCnt ); 

if ( err ) return err; 

> 

err = EmitCount ( it^ -> rptCnt • 1 ); 
if ( err ) return err; 
printf (» RBLK «Xd,Xd)*Xd\n", ibp -> matchLen, ibp -> diffCnt, ibp -> 



// 

rptCnt ); 
Mil DEBUG 



20 rptCnt }; 



#endff 



prfntf (" RBLK «3£d,5td>*)Sd", ibp -> natchLen, ibp -> dtffCnt, ibp -> 

printf <"\tv=Xd,c«Xd,o«Xd,9-SW\n", 
ibp -> coverage, 

ibp -> cost, 
ibp •> overhead, 
ibp -> gain ); 



err - EmitBytes ( ibp -> ptr, ibp -> matchLen ); 
if ( err ) return err; 
cnt B ibp -> rptCnt - 1; 
ptr s ibp -> ptr * ibp -> matchLen; 
while < —cnt >" 0 ) c 

err = EraitBytes ( ptr, ibp •> diffCnt ); 

if < err ) return err; 

ptr ♦= ibp -> matchLen + ibp -> diffCnt; 

break; 
case kRepeatZero ; 

err s EmitOpcode ( kRepeatZero, ibp -> matchLen ); 
if < err ) return err; 
err « EmftCount < ibp -> diffCnt ); 
if ( err ) return err; 
err « EmitCount ( ibp -> rptCnt - 1 >; 
if ( err ) return err; 
printf (" RZRO #<Xd,Xd>*%d\n". ibp -> matchLen. ibp -> diffcnt, ibp -> 



// 

rptCnt >; 
#if DEBUG 



rptCnt ); 



#endif 



default 
«if DEBUG 



printf <" RZRO #(56d,Xd)*Xd", ibp -> matchLen, ibp -> diffCnt, ibp -> 

printf <"\tv=Xd.C8M,oaXd,Q=%d\n«, 
ibp •> coverage, 
ibp -> cost, 
ibp -> overhead, 
ibp •> gain ); 

cnt o ibp -> rptCnt - 1; 

ptr B ibp -> ptr + ibp -> matchLen; 

Khile ( --cnt 0 > < 

err ■ EmitBytes < ptr, ibp -> diffCnt ); 

if C err ) return err; 

ptr ibp -> matchLen ♦ ibp -> diffCnt; 

break; 



printf ("??? 5td\n", ibp -> opcode); 
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break; 

/* 

If < ch ==• ) 

gPackedSize ♦= ibp -> overhead ♦ ibp -> cost; 



ibp -> opcode ■ kNoQpcode; 
«if DEBUG 

if ( debug ) fflush ( stdout ); 

#endif 

return noErr; 
} /* EmftOnelnst () */ 



20 OSErr Emit Inst ( Packlnstrptr pi ) 
OSErr err; 

if ( pi -> opcode »« kNoOpcode ) return noErr; 
do err » EraitOnelnst ( 'a*, pi ); 

pf •> opcode - kNoOpcode; 
return err; 
> /* Enltlnst () •/ 



int Hatches ( register Ixfte •ptr, register byte •end, PacklnstrPtr bestlnstr > 



register long distance; 

register long matchLen; 
35 register long rptCnt; 

long bestKatchLen; 

long repeatedZeros; 

long zeroBytes; 

byte •match; 
40 byte found; 

byte tryShorter; 

byte ch; 

Packlnstr thislnstr; 

long backup; 
45 long backupCnt; 

long backi^JdptCnt; 

long fwd; 

long fwdCnt; 

long fwCCnt; 
50 long fwdRptCnt; 

long fwdSRptCnt; 

long thisFwd; 

long matchCount; 

55 clearbytcs ( (byte •> bestlnstr, sizeof *bestlnstr >; 

found = FALSE; 

bestNatchLen * 0; 

60 ch B •ptr; 

for ( distance = 1; distance <n kNaxForyarcMatch; distance*^ > { 

if < ptr ♦ distance >= end ) break; 

if < ch != ptr [distance] ) continue; 
- II found a potential match 

^5 for < netchLen « 1; matchLen < distance; matchLen** ) C 
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if ( ptr « distance * matchLen >- end ) break; 

if ( ptr [matchLen} !» ptr (distance * matchLen] ) break; 

> 

5 if ( matchLen <^ bestHatchLen ) continue; 

bestMatchLen b matchLen; 

Shorter: 

10 tryShorter e FALSE; 

repeatedZeros ■ Count ZeroBytes ( ptr, matchLen ); 
// if ( repeatedZeros == matchLen ) then ue know it is 
// a repeated zero 

15 // type of match 

thislnstr. diffCnt & distance - matchLen; 

if ( thislnstr. diffCnt » 0 && matchLen > 1 ) < 
20 // remember, diff count is zero 

// so, this is a repeat only 
if ( repeatedZeros == matchLen !| 
matchLen >& 5 && 

5 * repeatedZeros >= 3 • matchLen ) 
25 continue; 
> 

thislnstr. backup a O; 

backup 3 0; 
30 backupCnt » O; 

backi^ptcnt s 0; 

fwd ° 0; 

fwdCnt 8 0; 

fwdRptCnt = 0; 
35 fwdSCnt « 0; 

fwdJRptCnt a 0; 

match = ptr; 
zercBytes b O; 
40 for ( rptcnt = 1; ; rptcnt** > { 

match distance; 

if ( match * matchLen >s end ) break; 



45 



50 



matchCount » Count ConpareBytes ( ptr, match, matchLen ); 

if ( matchCount < matchLen } { 

if ( matchCount ) tryShorter = TRUE; 
break; 

> 



// match of at least one (ie: rpt of 2) 
// now, see if it is practical to 90 backward 
// to extend the match 
// or should this be an on going thing 
55 if ( backi^j == 0 ) < 

backupRptCnt » rptCnt; 
while ( backup < thislnstr. diffCnt M 
buflnst. opcode ss kZero ftft 
backup < buflnst. rptCnt M 
60 ptr [-backup-1] « match ([-backi^j-n ) { 

backup*^-*-; 

> 

} 

// count backup byte compares 
o5 else if ( comparebytes ( ptr-backup, match-backup, backup > ) { 
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backupCnt-*^; 

> 

else ( 

//we coyldr't continue the backup. 
5 // is it worth stopping the match just to allow the backup? 

// or, deleting the back up? 
if ( backupCnt > 3 ) < 
.y/ ft uas worth it : 
//so, either use the backup 
10 //or stop this repeat short of it 

if ( backupRptcnt ia 1 } 

rptcnt - backupRptcnt - 1; /* ! to try to catch the backup next time! •/ 
break; 

15 backup 3 0; 

backupCnt - 0; 
backupRptcnt - 0; 

> 

20 if ( thislnstr. diffCnt ) < 

// count forward conpares 

thisFwd = CountCompareBytes < match - thislnstr. dlffCnt. match ♦ iratchLen. 
thislnstr. diffCnt ); 
25 if < fwd == 0 ) { 

if < thisFwd ) < 
fwd B thisFwd; 
fwdCnt = 1; 

2^ fwdRptCnt " rptCnt-1; 

> 

else if ( thfsFud > < 

// some match larger than the current Instruction 
f MdCnt**; 

35 > 

else { 

// extra match count no longer present 
if ( fwdCnt >= matchLen ♦ 3 ) { 
// this match cnt is noteworthy: 
^0 // record repeat count of this match 

// to stop at, so that a resart can 
// catch this new longer natch 
if ( fwdSCnt =o 0 > fwdSRptCnt = fwdRptCnt; 
if ( ++fud3Cnt >o 2 ) < 
45 fwd o 0; 

rptCnt = fwdSRptCnt; 
break; 

> 

> 

50 fwd » 0; 

fwdCnt s 0; 
fwdRptCnt s 0; 

> 

55 zeroBytes CountZeroBytes < match - thislnstr. diffCnt /• matchLen •/ 

thislnstr. diffcnt ); * 
if ( repeatedZeros = matchLen ) < 

if ( zeroBytes « thislnstr. diffCnt \\ 

3 • zeroBytes >■ thislnstr. diffCnt • rptcnt ) break; 

else { 

«ff trim 

^ if < 5 ♦ zeroBytes >a 3 • thislnstr. diffCnt • rptCnt > break; 

ffendif 
65 > 
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> 



ii ( fwd && fwi^nt > 3 ) < 
5 // stopped in the middle of a match 

// that had an extended match 
rptCnt = fwdRptCnt; 

> 

10 ff < rptCnt < 2 ) continue; 

if ( rptCnt == 2 &S matchLen « 1 M thislnstr. diffCnt «= 1 ) continue; 



// trim last repeat off of a RZRO if its diffpart is all zeros 
// ?? if ( repeatedZeros »» ) 

// match a dist, mien, for rptCnt 



// did we stop with a good backup? 
if ( backup U backupKptCnt » 1 ) 
20 thistnstr. backup b backip; 



// determine 1) coverage 
// 2) cost 

// 3> overhead 



thislnstr. coverage « distance * (rptCnt - 1) ♦ matchLen; 
thislnstr. ptr • ptr; 

thislnstr. end » ptr ♦ thislnstr. coverage; « 

30 if ( thislnstr. backup ) C 

matchLen thislnstr. backup; 

thislnstr. diffCnt -= thislnstr. backup; 

thislnstr. ptr -e thislnstr. backup; 

thislnstr. coverage-*^" thislnstr. backMp; 
35 ^ repeatedZeros ♦» CountZeroB>rtes < thislnstr. ptr, thislnstr. backup ); 

if ( repeatedZeros == matchLen ) < 
if ( thislnstr. diffCnt sa 0 ) < 
40 thislnstr. opcode = kZero; 

thislnstr. cost a 0; 

> 

else { 

thislnstr. opcode = kRepeatZero; 
45 thislnstr. cost = (rptCnt - 1) • thislnstr. diffCnt; 

} 

else { 

thislnstr. cost a natchLen ♦ (rptCnt - 1) * thislnstr. diffcnt; 
50 ^ thislnstr. opcode « kRepeatBlock; 

thislnstr. overhead = (matchLen la a> ♦ (thislnstr. diffCnt la 0> ♦ 1; 

if ( thislnstr. cost == 0 ) 
55 thislnstr. gain a thislnstr. coverage * ( 78 - thislnstr. overhead ): 

else 

thislnstr. gain a ( thislnstr. coverage • 100 > / ( thislnstr. cost ♦ 
thislnstr. overhead ); 

60 // thislnstr. gain -a thislnstr. overhead; 

//thislnstr. gain a thislnstr. coverage - thislnstr. cost - thislnstr. overhead; 



65 



// done later : 

// if < thislnstr. gain < 122 ) continue; 



wo 94/22078 



PCTAJS94/02952 



10 



25 



30 



35. 



40 



45 



- 109 - 

// these next 5 are really part of the If 
// but hear because they are useful for 
// the debugging in the else part 
thislnstr. matchLen = matchLen; 
thislnstr. rptCnt = rptCnt; 

!f ( thislnstr. gain >& 104 && 

( ! found }{ thislnstr. gain > bestlnstr •> gain > ) < 
found 8 TRUE; 



*bestlnstr ■ thislnstr; 

if ( thislnstr. diffcnt « 0 U 

thislnstr. coverage >« kMaxForwarcMatch > break; 

#if DEBUG 
15 if ( debug > 

EinitOnelnst ( '*' , i thislnstr }; 

«endif 

> 

20 «if DEBUG 

if ( debug ) 

EnitOnelnst ( ft thislnstr ); 

«endif 

> 



if ( tryShorter ) ( 
inatchLen = matchCount; 
goto Shorter; 

} 



return fomd; 
> /• Hatches (> V 



OSErr PaclcData ( register byte *pt''» register byte •end, b/te' *target, byte ♦•tend ) 
OSErr err; 

targeteuf o target; 
targetEnd ■ •tend; 

gBlockStart ■ ptr; 

buflnst. opcode " kNoOpcode; 
buflnst. end » 0; 



while ( ptr < end ) < 
50 If < Hatches ( ptr^ end, ft newlnst ) ) < 

if ( newlnst. backup } C 

buflnst. rptCnt -= newlnst. backup; 
. if ( buflnst. rptCnt = 0 > 

55 ^ buflnst. opcode = kNoOpcode; 

if ( ptr >= buflnst. end ) { 
err ■ Ecnitlnst ( ft buflnst ); 
oO if ( err ) return err; 

> 

buflnst s newlnst; 

// for now, advance 
o5 ptr s newlnst. end; 
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continue; 

> 

else ( 

5 If ( buflnst. opcode bs kBlock ) < 

buflnst. diffCnt**; 
buflnst. end**; 
buflnst. cost<M>; 
. buflnst. coverage**; 

10 // buflnst. gain**; 

) 

else < 

err = Emitlnst ( ft buflnst ); 
if ( err ) return err; 

clearbytes ( (byte *) & buflnst, sizeof buflnst ); 

buflnst. opcode • kBlock; 

buflnst. diffCnt s 1; 

buflnst. rptCnt = 1; 
20 buflnst. ptr = ptr; 

buflnst. end » ptr*1; 

buflnst. coverage = 1; 

buflnst. cost - 1; 

buflnst. overhead 1; 
25 buflnst. gain « ^oo - 1; 



30 



> 

ptr**; 

Enitlnst ( & buflnst ); 
*tend » targetBuf ; 



35 return noErr; 

} /* PackData () */ 



pascal void BlockClear ( void *s, long I ) 
40 { 

clearbytes < (byrte •) s, (long) I ); 
> /• BlockClear () */ 



45 long GetCount ( register byte *ptr, byte **nxt ) 
( 

register long value = *ptr*+; 
register byte nextB; 

50 if ( value I 0x80 ) < 

nextB 8 *ptr**; 
value « (value ft 0x7f ) « 7; 
ff ( nextB ft 0x80 ) { 
value *» nextB ft Ox7f ; 
55 nextB = *ptr**; 

value «= 7; 
if ( nextB ft 0x80 ) < 
value *s nextB ft 0x7f; 
nextB a *ptr**; 
60 value «s 7; 

ii ( nextB & 0x80 } < 
value *= nextB ft 0x7f; 
nextB s *ptr**; 
value «e 7; 

65 y 
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value ♦» nextB; 

) 

*nxt = ptr; 
return value; 
> /• GetCount () */ 
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OSErr UnpackData ( register byte *ptr. register byte *end, register b/te •trg, byte *tend ) 

byte *startPtr = ptr; 
register byte ul; 
register long cnt; 
register long cntX; 
register long rep; 
register long dif; 
register long offset; 
byte *nxt; 

#if DEBUG 

prfntf ("ptr = Xx«m end = 3U\n", ptr, end ); 
#endif 

while ( ptr < end ) { 
ul = *ptr++; 

if ( ul 4 kFirstOperancWask ) cnt ul & kFirstOperandHask; 
else { 

cnt = GetCount ( ptr, & nxt ); 
ptr - nxt; 

// ue know the count ts larger than kFirstOperamMask (about 31) 
switch ( ul » kOpcodeShift > < 
case kZero : 



#if DEBUG 
#endif 



«if DEBUG 
#endif 



case kBlock 



printf ( " ZRQx 53d\n", cnt ); 

if ( trg*cnt > tend > return -1; 
BlockClear < trg, cnt ); 
trg ♦= cnt; 
continue; 



printf ( " BLKX 53d\n*, cnt ); 

if < trg+cnt > tend > return -1; 
BtockMove ( ptr, trg, cnt >; 
ptr +8 cnt; 
trg +s cnt; 
continue; 



55 



60 



65 



switch ( u1 » kOpcodeShift > ( 
default : 



kOpcodeShift ); 



case kZero 
«if DEBUG 

#endif 



prfntf ( »# Xs: Unknown opcode Xd\n», gToolName, ul » 
goto EBOT; 

printf ( 2110 »d\n", cnt ); 

if < trg+cnt > tend ) goto EBOT; 
while ( --cnt >s o ) < *tr9** s O; ) 
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case kBlocIc 
#if DEBUG 

#endif 



case kRepeat 



#if DEBUG 
#end1f 



case kftepeatZero 



#if DEBUG 
tfendif 



LI : 



break; 

printf ( " BLK X3d\n», cnt ); 

if ( trg+cnt > tend ) goto EBOT; 

while ( --cnt >= 0 ) t •trg** = *ptr-M-; > 

break; 



rep B GetCount ( ptr, & nxt > « 1; 
ptr s nxt; 

printf ( » RPT X3d,X3d\n". cnt, rep >; 

if { cnt == 1 ) { 

// repeat one byte over and over, 

// coninon enough to test for and optimize 

u1 B *ptr+*; 

if ( trg^rep > tend ) goto EBOT; 
while < -rep >= 0 > < 
•trg** « ul; 

y 

break; 

) 

while ( --rep >= 0 ) { 
ptr = nxt; 
cntX « cnt; 

if ( trs+cntX > tend ) goto EBOT; 
while ( "cntx >s 0 ) < 
*trg*+ a *ptr^; 

> 

break; 



dif = GetCount ( ptr, & nxt ); 
ptr = nxt; 

rep = GetCount ( ptr, & nxt }; 
ptr B nxt; 

printf ( » RPT2 X3d,S3d,X3d\n", cnt, dif, rep*-l ); 

// NOTE: rep counts the # of dif parts 
// which equals the repeated parts - 1 
// so, we don't rep--; 

offset B 0; 

// do repeated part first 
goto LI; 

while ( -rep >= 0 ) ( 
// do diff part 
ptr ♦= offset; 
cntx = dif; 

if ( trg+cntX > tend ) goto EBOT; 
while ( --cntx >= 0 ) < 
*tr9^ = 'ptr-M-; 

> 

offset += dif; 

// do repeated part 
ptr = nxt; 
cntx s cnt; 



wo 94/22078 



PCT/US94/02952 



- 113 - 

If < trg+cntX > tend > goto EBOT; 
while ( "cntX >s 0 ) < 
•trg** B 0; 

• > ' 

ptr ♦= offset; 
break; 

case kRepeatBloelc : dif = GetCount ( ptr, ft nxt ); 
10 ptr = nxt; 

rep = GetCount ( ptr, & nxt ); 
ptr = nxt; 

#if DEBUG 

- _ printf ( " RPTB SBd,X5d,X3d\r»". cnt. dif, repH ); 
lb #endif 

// NOTE: rep counts the # of dif parts 
// which equals the repeated parts • 1 
// so, ue don't rep--; 

20 offset ~ 0; 

// do repeated part first 
goto L2; 

^_ while ( --rep 0 ) { 

25 // do diff part 

ptr +s offset; 

cntX = dif; 

if ( trg+cntx > tend ) goto EBOT; 
while ( --cntX >= 0 ) { 
30 *trg^+ = *ptr**; 

> 

offset *^ dif; 

- L2 : // do repeated part 
J 5 ptr s nxt; 

cntX s cnt; 

if < trg+cntx > tend ) goto EBOT; 
while ( --cntX >= 0 > { 
-trg** ■ *ptr+*; 

40 } 

> 

ptr offset; 
break; 

45 > 

J 

return noErr; 
50 EBOT : 

prfntf ( «# Xs: Unpack Error a Xd\n", gloolName, --ptr - startPtr ); 



55 



return -1; 
> /*'UnpackData (> V 

byte *P8rtialMaine ( byte •name ) 
{ 

byte *end; 



60 for ( end s name ♦ strlen ( (char *) nane ); • 

end > name; 
end-- ) { 



65 



If ( end t-11 « ,'} end t-1J « •/* ) break; 
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return end; 
J /• PartiaUame () «/ 
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CLAIMS 

1. A method for loading a relocatable file into a 
computer system memory for execution, said file having 
a plurality of information items to be loaded into said 
memory, certain ones of said information items having a 
respective first address field containing a relocatable 
address, said file further having a plurality of 
relocation instructions, comprising the steps of: 

retrieving one of said relocation instructions; and 
updating the address field of n consecutive ones of 
said information items in response to said one of said 
relocation instructions, n being specified in said one 
of said relocation instructions. 

2. A method according to claim l, further 
comprising the step of repeating said steps of 
retrieving and updating for different ones of said 
plurality of relocation instructions. 

3. A method according to claim 2, wherein said 
relocation instructions are of different types, said 
step of updating comprising steps which depend upon the 
type of each respective one of said relocation 
instructions. 

4. A method according to claim 2, wherein each of 
said relocation instructions is located at a respective 
address, wherein the address of said relocation 
instruction is determined from a first pointer (reloc) 
for each repetition of said step of retrieving, further 
comprising, in conjunction with each repetition of said 
step of retrieving, the step of updating said first 
pointer (reloc) to a value dependent upon the length of 
said retrieved relocation instruction. 

5 . A method according to claim 4 , wherein said step 
of updating said first pointer (reloc) comprises the 
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step of updating said first pointer (reloc) to point to 
the one of said relocation instructions which 
immediately follows said retrieved relocation 
instruction. 

6. A method according to claim 4, wherein said step 
of updating said first pointer (reloc) comprises the 
step of adding to said first pointer (reloc) the nximber 
of bytes in said retrieved relocation instruction. 

7. A method according to claim 2, wherein the 
address in said memory of the first of said n 
consecutive information items is determined from a 
second pointer (rAddr) for each repetition of said step 
of updating, further comprising, in conjunction with 
each repetition of said step of updating, the step of 
updating said second pointer (rAddr) to a value 
dependent upon n. 

8. A method according to claim 7, wherein said step 
of updating said second pointer (rAddr) comprises the 
step of updating said second pointer (rAddr) to point to 
the first address in said memory which follows said n 
consecutive information items. 

9. A method according to claim 7, wherein said step 
of updating said second pointer (rAddr) comprises the 
step of adding to said second pointer (rAddr) , n times 
a predetermined number of bytes per information item. 

10. A method according to claim l, wherein said step 
of updating the first address field comprises the step 
of updating the first address field of each given 
information item in said n consecutive information items 
to a value which depends upon a respective given element 
of a table (imports) . 
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11. A method according to claim 10, wherein said 
step of updating the first address field of each given 
information item comprises the steps of: 

updating the first address field of an i'th one of 
said n consecutive information items to a value which 
depends upon an i'th element of said table, i being 
determined from a count value (rSyml) ; 

incrementing said count value (rSyml) in conjunction 
with each repetition of said step of updating the first 
address field of an i'th one of said n consecutive 
information items; and 

repeating, for each of said n consecutive information 
items, said step of updating the first address field of 
an i'th one of said n consecutive information items and 
said step of incrementing said count value (rSyml) . 

12. A method according to claim ll, wherein said 
step of updating the first address field of an i'th one 
of said n consecutive information items coitprises the 
step of adding the contents of said i'th element of said 
table to said first address field of said i'th one of 
said n consecutive information items. 

13. A method according to claim 1, wherein said 
step of updating the first address field comprises the 
step of updating the first address field of each given 
information item in said n consecutive information items 
to a value which depends upon. a third pointer (codeA, 
dataA) , the value in said third pointer (codeA, dataA) 
remaining constant for said entire step of updating the 
first address field of n consecutive ones of said 
information items. 

14. A method according to claim 13, wherein said 
step of updating the first address fields of each given 
information item comprises the step of adding the 
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contents of said third pointer (codeA, dataA) to said 
first address field of said given information item. 

15. A method according to claim 13, further 
comprising the step of selecting said third pointer 
(codeA, dataA) in dependence upon said relocation 
instruction (e.g. CODE, DATA). 

16, A method for loading a relocatable file into a 
computer system memory for execution, said file having 
a plurality of information items to be loaded into said 
memory, certain ones of said information items having a 
respective first address field containing a relocatable 
address, said file further having a plurality of 
relocation instructions, said relocation instructions 
being of different types, comprising the steps of: 

retrieving one of said relocation instructions from 
a location dependent upon a first pointer (reloc) ; 

updating said first pointer (reloc) in conjunction 
with each repetition of said step of retrieving; 

performing, if said retrieved relocation instruction 
is in a first class of relocation instructions (e.g. 
DDAT, CODE, DATA, DESC, DSC2, VTBL, SYMR, SYMB, SECN, 
LSECN) , a relocation operation on at least one of said 
information items, the first of said at least one of 
said information items being designated by a second 
pointer (rAddr) , said relocation operation including 
steps which depend upon the type of said, retrieved 
relocation instruction; 

updating said second pointer (rAddr) in conjunction 
with each performance of one of said relocation 
operations on said at least one of said information 
items, and 
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repeating said steps of retrieving, updating said 
first pointer (reloc) , performing, and updating said 
second pointer (rAddr) . 

17. A method according to claim 16, wherein said 
f retrieved relocation instruction is in a second class of 

relocation instructions {e.g. DDAT, CODE, DATA, DESC, 
DSC2, VTBL, SYMR) Which is within said first class, and 
wherein said steps of said relocation operation coxnprise 
the step of updating the first address field of n 
consecutive ones of said information items, the 
beginning of said n consecutive ones of said information 
items being designated by said second pointer (rAddr) , 
n being specified in said retrieved relocation 
instruction. 

18. A method according to claim 17, wherein said 
retrieved relocation instruction is in a third class of 
relocation instructions (e.g. DDAT, CODE, DATA, DESC, 
DSC2, VTBL) which is within said second class, and 
wherein said step of updating the first address field 
comprises the step of updating the first address field 
of each given information item in said n consecutive 
ones of said information items to a value which depends 
upon the prior contents of such first address field and 
upon a third pointer (codeA, dataA) , the value in said 
third pointer (codeA, dataA) remaining constant for said 
entire step of updating the first address field of n 
consecutive information items. • 

19. A method according to claim 17, wherein said 
retrieved relocation instruction is in a fourth class of 
instructions (e.g. DDAT) within said third class, and 
wherein said step of updating the first address field 
comprises the step of adding the contents of a third 
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pointer (dataA) to said first address field of each of 
said n consecutive information items, 

further comprising the step, if said retrieved 
relocation instruction is in said fourth class of 
instructions (e.g. DDAT) , incrementing said second 
pointer (rAddr) by a value specified in said retrieved 
relocation instruction prior to said step of performing 
a relocation operation. 

20. A method according to claim 17, wherein said 
retrieved relocation instruction is in a fifth class of 
relocation instructions {e.g. CODE, DATA) within said 
second class, said fifth class including a sixth class 
of relocation instructions (e.g. CODE) and a seventh 
class of relocation instructions (e.g. DATA) , and 
wherein said step of updating the first address field 
comprises the steps of: 

adding the contents of a third pointer (codeA) to 
said first address field of each of said n consecutive 
information items if said retrieved relocation 
instruction is in said sixth class of instructions (e.g. 
CODE) ; and 

adding the contents of a fourth pointer (dataA) to 
said first address field of each of said n consecutive 
information items if said retrieved relocation 
instruction is in said seventh class of instructions 
(e.g. DATA) . 

21. A method according to claim 17, wherein said 
retrieved relocation instruction is in an eighth class 
of instructions (e.g. VTBL) within said second class, 

wherein said .step of updating the first address 
field comprises the step of adding the contents of a 
fourth pointer (dataA) to the first word of each of said 
n consecutive information items, each of said n 
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consecutive information items having a length of two 
words , 

and wherein said step of updating said second 
pointer (rAddr) comprises the step of incrementing said 
second pointer by 2n words. 

22. A method according to claim 17, wherein said 
retrieved relocation instruction is in a ninth class of 
relocation instructions (e.g. DESC, DSC2) within said 
second class, 

wherein said step of updating the first address 
field comprises the step of adding the contents of a 
third pointer (codeA) to said first address field of 
each of said n consecutive information items, 

wherein each of said n consecutive information items 
further has a second address field, 

said steps of said relocation operation further 
comprising the step of adding the contents of a fourth 
pointer (dataA) to said second address field of each of 
said n consecutive information items. 

23. A method according to claim 17, wherein said 
retrieved relocation instruction is in a tenth class of 
instructions (e.g. SYMR) , and wherein said step of 
updating the first address field comprises the steps of: 

adding, to the first address field of each i'th one 
of said n consecutive information items, the contents of 
a respective i'th element of n consecutive elements in 
a table (imports) , the beginning of said n consecutive 
elements in said table being determined from a count 
value (rSyml) ; and 

adding n to said count value (rSyml) . 

24. A method according to claim 16, wherein said 
retrieved relocation instruction is in am eleventh class 
of relocation instructions (e.g. SYMB, SECN, hSECN) 
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within said first class, and wherein said steps of said 
relocation operation comprise the step of updating the 
first address field of the information item designated 
by said second pointer (rAddr) . 

25. A method according to claim 24, wherein said 
step of updating the first address field comprises the 
step of adding to said first address field the contents 
of a pointer { imports [ IDX9 ] , regions [ IDX9 ] , 
regions [CNT22 3 ) specified by said retrieved relocation 
instruction. 

26. A method according to claim 25, wherein said 
retrieved relocation instruction is in a twelfth class 
instructions (e.g. SYMB) within said eleventh class, 
wherein said pointer specified by said retrieved 
relocation instruction is given as the i'th element of 
a table (imports) , i being specified in said retrieved 
relocation instruction, said method further conqprising 
the step of maintaining a count value (rSyml) indicating 
the next element of said table (imports) to use, said 
steps of said relocation operation further comprising 
the step of storing i + l as said count value (rSyml) 
for subsequent use. 

27. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a thirteenth class of instructions 
(e.g. GDIS, DTIS, LCDIS, LDTIS) , updating the contents 
of said third pointer (codeA, dataA) to a value which 
depends on said retrieved relocation instruction. 

28. A method according to claim 27, wherein said 
plurality of information items is to be loaded .into a 
plurality of regions of said memory, each of said 
regions containing contiguously a respective group of at 
least one of said plurality of information items, each 
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Of said regions having a respective designation and a 
respective start address, 

and wherein said step of updating the contents of 
said third pointer (codeA, dataA) comprises the step of 
storing as said third pointer {codeA, dataA) the start 
address of one of said regions whose designation is 
specified in said retrieved relocation instruction. 

29. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a fourteenth class of instructions 
(e.g. DELTA), adding a value to said second pointer 
(rAddr), which value is specified in said retrieved 
relocation instruction. 

30. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a fifteenth class of instructions 
(e.g. LABS) , storing a value specified in said retrieved 
relocation instruction as said second pointer (rAddr) . 

31. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a sixteenth class of instructions 
(e.g. RPT, LRPT) , the step of updating the contents of 
said first pointer (reloc) to a value which depends upon 
said retrieved relocation instruction. 

32. A method according to claim 31, wherein said 
step of updating the contents of said first pointer 
(reloc) comprises the step of subtracting a value j from 
said contents of said first pointer (reloc), j being 
specified in said retrieved relocation instruction. 

33. A method according to claim 32, further 
comprising the steps of: 

maintaining a count of the number of repetitions of 
said step of subtracting; and 
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inhibiting said step of subtracting after m of such 
repetitions, m being specified in said retrieved 
relocation instruction. 

34. A method according to claim 23, further 
comprising the step of, if said retrieved relocation 
instruction is in an seventeenth class of instructions 
(e.g. LSYM) , updating said count value (rSyml) to a 
value which depends upon said retrieved relocation 
instruction. 

35. A method according to claim 26, further 
comprising the step of, if said retrieved relocation 
instruction is in an seventeenth class of instructions 
(e.g. LSYM) , updating said count value (rSyml) to a 
value which depends upon said retrieved relocation 
instruction. 

36. An information storage medium carrying a 
relocatable file for loading into a computer system 
memory, said file having a plurality of information 
items to be loaded into said memory, certain ones of 
said information items having a respective first address 
field containing a relocatable address, said file 
further having a plurality of relocation instructions, 
including a first instruction which specifies updating 
of the first address field of n consecutive ones of said 
information items, n being specified in said first 
relocation instruction. 

37. A medium according to claim 36, wherein said 
relocation instructions further include a second one of 
said relocation instructions, said second relocation 
instruction specifying updating of an address field of 
m consecutive ones of said information items, m being 
specified in said second relocation instruction. 
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38. A medium according to claim 37, wherein said 
first and second relocation instructions are of 
respective different first and second types, and wherein 
the updating specified by relocation instructions of 
said first type is different from the updating specified 
by relocation instructions of said second type. 

39. A medium according to claim 37, wherein said 
first and second relocation instructions have different 
byte lengths. 

40. A medium according to claim 36, wherein said 
first relocation instruction does not specify which of 
said n consecutive information items are to have an 
address field updated. 

41. A medium according to claim 36, wherein said 
updating of the first address field of said n 
consecutive information items as specified by said first 
relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a respective given element of a table (imports) . 

42. A medivim according to claim 41, wherein said 
updating of the first address field of a given 
information item as specified by said first relocation 
instruction includes adding the contents of said 
respective given element of said table to said first 
address field of said given information item. 

43. A medium according to claim 36, wherein said 
updating the first address field as specified by said 
first relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a third pointer (codeA, dataA) , the value in said 
third pointer (codeA, datcA) remaining constant for said 



wo 94/22078 



PCT/US94/02952 



- 126 - 

entire updating of the first address field of n 
consecutive ones of said information items. 

44. A medium according to claim 43, wherein said 
updating of the first address fields of each given 
information item as specified by said first relocation 
instruction includes adding of the contents of said 
third pointer (codeA, dataA) to said first address field 
of said given information item; 

45. A medium according to claim 43, wherein said 
updating the first address field as specified by said 
first relocation instruction further includes a 
specification of which of a predetermined plurality of 
pointers constitutes said third pointer. 

46. An information storage medium carrying a 
relocatable file for loading into a computer system 
memory, said file having a plurality of information 
items to be loaded into said memory, certain ones of 
said information items having a respective first address 
field containing a relocatable address, said file 
further having a plurality of relocation instructions, 
said relocation instructions being of different types, 
a first class of said relocation instructions (e.g. 
DDAT, CODE, DATA, DESC, DSC2, VTBL, SYMR, SYMB, SECN, 
liSECN) specifying a relocation operation on at least one 
of said information items, said at least one of said 
information items not being designated, in said 
relocation instructions in said first class of 
relocation instructions, the type of said relocation 
instruction identifying the relocation operation to be 
performed on said at least one of said information 
items. 

47. A medium according to claim 46, wherein the 
relocation operation identified by each given relocation 
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instruction in a second class of relocation instructions 
{e.g. DDAT, CODE, DATA, DESC, DSC2 , VTBL, SYMR) which is 
within said first class, includes updating the first 
address field of n consecutive ones of said information 
items, said n consecutive ones of said information items 
not being designated in said given relocation 
instruction, n being specified in said given relocation 
instruction. 

48. A medium according to claim 47, wherein the 
updating to be performed if said given relocation 
instruction is in a third class of relocation 
instructions {e.g. DDAT, CODE, DATA, DESC, DSC2, VTBL) 
which is within said second class, includes updating the 
first address field of each given information item in 
said n consecutive ones of said information items to a 
value which depends upon the prior contents of such 
first address field and upon a third pointer {codeA, 
dataA) , the value in said third pointer (codeA, dataA) 
remaining constant for said entire updating of the first 
address field of n consecutive information items. 

49. A medium according to claim 47, wherein the 
updating to be performed if said given relocation 
instruction is in a fourth class of instructions {e.g. 
DDAT) within said third class, includes adding of the 
contents of a third pointer {dataA) to said first 
address field of each of said n consecutive .information 
items, 

said given relocation instruction further specifying 
a value by which a second pointer {rAddr) pointing to 
said information items is to be incremented prior to 
said updating. 

50. A medium according to claim 47, wherein the 
updating to be performed includes adding the contents of 



wo 94/22078 PCT/US94/02952 



- 128 - 

a third pointer (codeA) to said first address field of 
each of said n consecutive information items if said 
given relocation instruction is in a sixth class, of 
relocation instructions (e.g. CODE) which is within said 
second class, and adding the contents of a fourth 
pointer (dataA) to said first address field of each of 
said n consecutive information items if said given 
relocation instruction is in a seventh class of 
instructions (e.g. DATA) which is within said second 
class. 

51. A medium according to claim 47, wherein each of 
said n consecutive information items includes a first 
word and a second word, wherein said updating to be 
performed if said given relocation instruction is in an 
eighth class of instructions (e.g. VTBL) within said 
second class, includes adding the contents of a fourth 
pointer (dataA) to the first word of each of said n 
consecutive information items, the relocation operation 
identified by relocation instructions in said eighth 
class of instructions not including any updating of the 
second word of each of said n consecutive information 
items. 

52. A medium according to claim 47, wherein each of 
said n consecutive information items further includes a 
respective second address field containing a relocatable 
address, wherein the relocation operation identified by 
each given relocation instruction in a ninth class of 
relocation instructions (e.g. DESC, DSC2) within said 
second class, includes adding the contents of a third 
pointer (codeA) to said first address field of each of 
said n consecutive information items and adding the 
contents of a fourth pointer (dataA) to said second 
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address field of each of said n consecutive information 
items . 

53. A medium according to claim 47, wherein the 
updating to be performed if said given relocation 
instruction is in a tenth class of instructions (e.g. 
SYMR) , includes adding, to the first address field of 
each i'th one of said n consecutive information items, 
the contents of a respective i'th element of n 
consecutive elements in a table (inports) . 

54. A medium according to claim 46, wherein said 
relocation operation specified by each given relocation 
instruction which is in an eleventh class of relocation 
instructions (e.g. SYMB, SECN, LSECN) within said first 
class, includes updating the first address field of the 
information item designated by a second pointer (rAddr) 
the contents of which are not specified in said 
relocation instruction. 

55. A medium according to claim 54, wherein said 
updating of the first address field includes adding to 
said first address field the contents of a pointer 
(imports [IDX9] , regions [IDX9] , regions [CNT22] ) specified 
by said given relocation instruction. 

56. A medium according to claim 46, wherein said 
plurality of relocation instructions further includes 
relocation instructions in a thirteenth class of 
instructions (e.g. DCIS, DTIS, LCDIS, LDTIS) , each 
relocation instruction in said thirteenth class of 
instructions specifying that a third one of a 
predetermined plurality of pointers (codeA, dataA) is to 
be updated, the type of each relocation instruction in 
said thirteenth class of instructions determining which 
of said predetermined plurality of pointers (codeA, 
dataA) is to be said third pointer. 
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57. A mediiim according to claim 56, wherein said 
plurality of information items is to be loaded into a 
plurality of regions of said memory, each of said 
regions containing contiguously a respective group of at 
least one of said plurality of information items, each 
of said regions having a respective designation and a 
respective start address, 

and wherein the updating specified by each given 
relocation instruction in said thirteenth class of 
instructions includes storing as said third pointer 
(codeA, dataA) the start address of one of said regions 
whose designation is specified in said given relocation 
instruction. 

58. A medium according to claim 46, wherein said 
relocation instructions further include a . fourteenth 
class of instructions (e.g. DELTA), each relocation 
instruction in said fourteenth class of instructions 
specifying a value to be added to a second pointer 
(rAddr) into said information items. 

59. A medium according to claim 46, wherein said 
relocation instructions further include a fifteenth 
class of instructions (e.g. LABS) , each relocation 
instruction in said fifteenth class of instructions 
specifying a value to be stored as a second pointer 
< rAddr) into said information items. 

60. A medium according to claim 46, wherein said 
relocation instructions further include a sixteenth 
class of instructions (e.g. RPT, LRPT) , each instruction 
in said sixteenth class of instructions specifying a 
number of times for which an immediately preceding group 
of m of said relocation instructions are to be repeated. 
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61. A znediiim according to claim 60, wherein each 
instruction in said sixteenth class of instructions 
further specifies m. 

62 . An information storage medium carrying a file 
for loading into a computer system memory, said file 
comprising: 

an input portion associating a plurality of import 
symbol names with respective inport symbol numbers; and 

an export portion associating a plurality of export 
symbol names with respective export symbol location 
values, said export portion being distinct from said 
import portion. 

63. A medixim according to claim 62, wherein said 
import portion comprises an import symbol table having 
an import table entry corresponding to each of said 
import symbol names, each of said import table entries 
having a respective location in said table and 
identifying the import symbol name to which it 
corresponds, the import symbol number associated with 
each of said import symbol names being derivable from 
the location of the import table entry corresponding to 
the import symbol name in said import symbol table. 

64. A medium according to claim 63, wherein said 
file further comprises a string table containing said 
import symbol names, and wherein each of said entries in 
said import symbol table includes an offset into said 
string tsUble to identify the import symbol name to which 
the entry corresponds. 

65. A medium according to claim 63, wherein the 
import symbol numbers for all of said import symbol 
names collectively constitute a series of consecutive 
integers . 
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66. A medium according to claim 62, wherein each of 
said export symbol location values is selected from the 
group consisting of {a) a pointer to a location in said 
file and (b) an import symbol niamber. 

67. A medium according to claim 62, wherein said 
export portion comprises an export symbol table having 
an export table entry corresponding to each of said 
export symbol names, each of said export table entries 
identifying the export symbol name to which it 
corresponds and the export symbol location value 
associated with the export symbol name to which it 
corresponds . 

68. A medi\im according to claim 62, wherein said 
file further comprises at least one loadable section, 
and wherein a first one of said export symbol location 
values includes a pointer to a location in a first one 
of said loadable sections. 

69. A medium according to claim 68, wherein said 
first export symbol location value further selects which 
of said at least one loadeOsle section is said first 
loadable section. 

70. A medium according to claim 62, wherein said 
import portion comprises an inport symbol table having 
an import table entry corresponding to each of said 
import symbol names, each of said import table entries 
having a respective location in said table and 
identifying the import symbol name to which it 
corresponds, the import symbol number associated with 
each of said import symbol names being derivable from 
the location of the inqport table entry corresponding to 
the import symbol name in said import symbol table, said 
import symbol table being distinct from said export 
symbol table. 
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71. A medium according to claim 70, wherein said 
file further comprises a plurality of import syxnbol 
strings located in an import symbol name string portion 
of said file, and a plurality of export symbol strings 
located in am export symbol name string portion of said 
file, 

wherein each of said import table entries designates 
one of said import symbol name strings to identify the 
inport symbol nsune to which the import table entry 
corresponds , 

wherein each of said export table entries designates 
one of said export symbol name strings to identify the 
export symbol name to which the export table entry 
corresponds , 

and wherein said import and export symbol name 
string portions of said file are distinct. 

72. A medixim according to claim 71, wherein said 
import and export symbol. name string portions are each 
distinct from said export symbol table, and wherein said 
export symbol name string portion of said file is 
located closer in said file than said import symbol name 
string portion of said file, to said export symbol 
table. 

73 . A medium according to claim 72 , wherein said 
import and export symbol name string portions are each 
distinct from said import symbol table, and wherein said 
import symbol name string portion of said file is 
located closer in said file than said export symbol name 
string portion of said file, to said import symbol 
table. 

74. A medium according to claim 67, wherein said 
export symbol teOble is hashed. 
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75. A medium according to claim 67, wherein said 
file further comprises an export slot table which 
associates each of said export symbol names with a 
predetermined subset of said export table entries . 

76. A medium according to claim 67, wherein said 
file comprises a chain entry corresponding to each of 
said export symbol names, each of said chain entries 
containing a hashword which is a first predefined 
function of the export symbol name corresponding to the 
chain entry, said chain entries being grouped into 
slots, at least one of said slots having more than one 
chain entry, each slot corresponding to a respective one 
of a plurality of hash slot values, the hash slot value 
corresponding to each given slot being a second 
predefined function of each of the export symbol names 
corresponding to a chain entry in said given slot. 

77. A medium according to claim 76, wherein said 
file further coinprises an export slot table which 
associates each given one of said hash slot values with 
the chain entries in the slot which correspond to said 
given hash slot value. 

78. A medium according to claim 62, wherein said 
file further comprises a library portion associating a 
plurality of import library names with respective sets 
of said import symbol numbers, said library portion 
being distinct from said import portion and from said 
export portion - 

79. A medium according to claim 62, wherein said 
file has a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, said file further having a 
plurality of relocation instructions, including a first 
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instruction which specifies updating of the first 
address field of n consecutive ones of said information 
items, n being specified in said first relocation 
instruction. 

80. A medium according to claim 62, said file 
having a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, said file further having a 
plurality of relocation instructions, said relocation 
instructions being of different types, a first class of 
said relocation instructions (e.g. DDAT, CODE, DATA, 
DESC, DSC2, VTBL, SYMR, SYMB, SECN, LSECN) specifying a 
relocation operation on at least one of said information 
items, said at least one of said information items not 
being designated in said relocation instructions in said 
first class of relocation instructions, the type of said 
relocation instruction identifying the relocation 
operation to be performed on said at least one of said 
information items. 

81. An information storage medium carrying a file 
for loading into a computer system memory, said file 
having an export symbol table associating a plurality of 
export symbol names with respective export symbol 
location values, said export symbol table being hashed. 

82. An information storage meditam carrying a file 
for loading into a computer system memory, said file 
having an export symbol table associating a plurality of 
export symbol names with respective export symbol 
location values, and an export slot table which 
associates each of said export symbol ncuties with a 
predetermined subset of said export table entries. 
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83. An infonnation storage medium carrying a file 
for loading into a computer system memory, said file 
having an export symbol table associating a plurality of 
export symbol names with respective export symbol 
location values, said file further comprising a chain 
entry corresponding to each of said export symbol names, 
each of said chain entries containing a hashword which 
is a first predefined function of the export symbol name 
corresponding to the chain entry, said chain entries 
being grouped into slots, at least one of said slots 
having more than one chain entry, each slot 
corresponding to a respective one of a plurality of hash 
slot values, the hash slot value corresponding to each 
given slot being a second predefined function of each of 
the export symbol names corresponding to a chain entry 
in said given slot. 

84. A medium according to claim 83, wherein said, 
file further coit^jrises an export slot table which 
associates each given one of said hash slot values with 
the chain entries in the slot which correspond to said 
given hash slot value. 

85. A method for loading a relocatable file into a 
computer system memory for execution, said file having 
a plurality of information items to be loaded into said 
memory, a first one of said information items having an 
address field and an indication of a first import symbol 
name for use in updating said address field, comprising 
the steps of: 

loading an import file into said memory, said import 
file having a table of export symbols containing a 
plurality of export symbol slots each having a slot 
value, each of said slots in said table of export 
symbols containing at least one association associating 
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an export symbol name to a respective location value, at 
least one of said slots in said tcQDle of export symbols 
containing more than one such association, all of the 
export symbol names associated in each given one of said 
slots having the same hash slot value according to a 
predefined hash- slot -value algorithms- 
determining a first hash slot value for said first 
import symbol name according to said predefined hash- 
slot -value algorithm; 

searching for a first export symbol name in a slot 
corresponding to said first hash slot value in said 
table of export symbols in said import file, which 
export symbol name matches said first import symbol 
neune ; and 

updating said address field of said first 
information item in response to the location value 
associated by said table of export symbols to said first 
export symbol name. 

86. A method according to claim 85, wherein each of 
said associations in said table of export syitODOls in 
said import file further associates a hashword value 
computed from the export symbol name of the association 
according to a predefined hash -word -value algorithm, 
said method further comprising the step of determining 
a first import hashword value for said first import 
symbol name according to said hash- word -value algorithm, 
and wherein said step of searching comprises the step 
of: 

determining for successive current ones of said 
associations in said slot corresponding to said first 
hash slot value in said table of export symbols, until 
the export symbol name of said current association 
matches said first import symbol name, first whether the 



wo 94/22078 PCTAJS94/02952 



- 138 - 

eaqport hashword value of said current association 
, matches said first import hashword value, and only if 
so, whether the export symbol name of said current 
association matches said first import symbol name. 

87. A method according to claim 86, wherein said 
import file further has an export slot table associating 
each given slot in said table of export symbols with the 
slot value of said given slot, said method further 
comprising the step of examing said export slot table to 
determine said slot corresponding to said first hash 
slot value. 

88. A method according to claim 85, wherein said 
import file further has an export slot table associating 
each given slot in said tcible of export symbols with the 
slot value of said given slot, said method further 
comprising the step of examing said export slot table to 
determine said slot corresponding to said first hash 
slot value. 

89. A method for constructing a relocatsQ^le file 
for loading into a computer system memory, said file 
having a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, comprising the steps of: 

identifying n consecutive ones of said certain 
information items; and 

writing to said file a first relocation instruction 
which specifies updating of the first address field of 
said n consecutive information items, n being specified 
in said first relocation instruction. 

90. A method according to claim 89, further 
comprising the steps of: 
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identifying m consecutive ones of said certain 
information items; and 

writing to said file a second relocation instruction 
which specifies updating of an address field of said m 
consecutive information items, m being specified in said 
second relocation instruction. 

91. A method according to claim 90, wherein said 
first and second relocation instructions are of 
respective different first and second types, and wherein 
the updating specified by relocation instructions of 
said first type is different from the updating specified 
by relocation instructions of said second type. 

92. A method according to claim 90, wherein said 
first and second relocation instructions have different 
byte lengths. 

93. A method according to claim 89, wherein said 
first relocation instruction does not specify which of 
said n consecutive information items are to have an 
address field updated. 

94. A method according to claim 89, wherein said 
updating of the first address field of said n 
consecutive information items as specified by said first 
relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a respective given element of a table (imports) . 

95. A method according to claim 94, wherein said 
updating of the first address field of a given 
information item as specified by said first relocation 
instruction includes adding the contents of said 
respective given element of said table to said first 
address field of said given information item. 
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96. A method according to claim 89, wherein said 
updating the first address field as specified by said 
first relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a third pointer (codeA, dataA) , the value in said 
third pointer (codeA, dataA) remaining constant for said 
entire updating of the first address field of n 
consecutive ones of said information items. 

97. A method according to claim 96, wherein said 
updating of the first address fields of each given 
information item as specified by said first relocation 
instruction includes adding of the contents of said 
third pointer (codeA, dataA) to said first address field 
of said given information item. 

98. A method according to claim 96, wherein said 
updating the first address field as specified by said 
first relocation instruction further includes a 
specification of which of a predetermined plurality of 
pointers constitutes said third pointer. 

99. A method for constructing a relocatable file 
for loading into a computer system memory, said file 
having a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, comprising the steps of; 

identifying a first plurality of said certain 
information items; and 

writing to said file a plurality of relocation 
instructions, said relocation instructions being of 
different types, a first class of said relocation 
instructions (e.g. DDAT, CODE, DATA, DESC, DSC2, VTBL, 
SYMR, SYMB, SECN, LSECN) specifying a relocation 
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operation on at least one of said information items in 
said first plurality, said at least one of said 
information items in said first plurality not being 
designated in said relocation instructions in said first 
class of relocation instructions, the type of said 
relocation instruction identifying the relocation 
operation to be performed on said at least one of said 
information items in said first plurality. 

100. A method according to claim 99, wherein the 
relocation operation identified by each given relocation 
instruction in a second class of relocation instructions 
(e.g. DDAT, CODE, DATA, DESC, DSC2, VTBL, SYMR) which is 
within said first class, includes updating the first 
address field of n consecutive ones of said information 
items in said first plurality, said n consecutive ones 
of said information items not being designated in said 
given relocation instruction, n being specified in said 
given relocation instruction. 

101. A method according to claim 100, wherein the 
updating to be performed if said given relocation 
instruction is in a third class of relocation 
instructions {e.g. DDAT, CODE, DATA, DESC, DSC2, VTBL) 
which is within said second class, includes updating the 
first address field of each given information item in 
said n consecutive ones of said information items to a 
value which depends upon the prior contents of such 
first address field and upon a third pointer (codeA, 
dataA) , the value in said third pointer {codeA, dataA) 
remaining constant for said entire updating of the first 
address field of n consecutive information items. 

102. A method according to claim 100, wherein the 
updating to be performed if said given relocation 
instruction is in a fourth class of instructions (e.g. 
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DDAT) within said third class, includes adding of the 
contents of a third pointer (dataA) to said first 
address field of each of said n consecutive information 
items, 

said given relocation instruction further specifying 
a value by which a second pointer (rAddr) pointing to 
said information items is to be incremented prior to 
said updating. 

103. A method according to claim 100, wherein the 
updating to be performed includes adding the contents of 
a third pointer (codeA) to said first address field of 
each of said n consecutive information items if said 
given relocation instruction is in a sixth class of 
relocation instructions (e.g. CODE) which is within said 
second class, and adding the contents of a fourth 
pointer (dataiA) to said first address field of each of 
said n consecutive information items if said given 
relocation instruction is in a seventh class of 
instructions (e.g. DATA) which is within said second 
class. 

104. A method according to claim 100, wherein each 
of said n consecutive information items includes a first 
word and a second word, wherein said updating to be 
performed if said given relocation instruction is in an 
eighth class of instructions {e.g. VTBL) within said 
second class, includes adding the contents of a fourth 
pointer (dataA) to the first word of each of said n 
consecutive information items, the relocation operation 
identified by relocation instructions in said eighth 
class of instruction^ not including any updating of the 
second word of each of said n consecutive information 
items. 
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105. A method according to claim 100, wherein each 
of said n consecutive information items further includes 
a respective second address field containing a 
relocatable address, wherein the relocation operation 
identified by each given relocation instruction in a 
ninth class of relocation instructions (e.g. DESC, DSC2) 
within said second class, includes adding the contents 
of a third pointer (codeA) to said first address field 
of each of said n consecutive information items and 
adding the contents of a fourth pointer (dataA) to said 
second address field of each of said n consecutive 
information items. 

106. A method according to claim 100, wherein the 
updating to be performed if said given relocation 
instruction is in a tenth class of instructions (e.g. 
SYMR) , includes adding, to the first address field of 
each i'th one of said n consecutive information items, 
the contents of a respective i'th element of n 
consecutive elements in a table (imports) . 

107. A method according to claim 99, wherein said 
relocation operation specified by each given relocation 
instruction which is in an eleventh class of relocation 
instructions (e.g. SYMB, SECN, LSECN) within said first 
class, includes updating the first address field of the 
information item designated by a second pointer (rAddr) 
the contents of which are not specified in said 
relocation instruction. 

108. A method according to claim 107, wherein said 
updating of the first address field includes adding to 
said first address field the contents of a pointer 
(imports [IDX9] , regions [IDX9] , regions [CNT22] ) specified 
by said given relocation instruction. 
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109. A method according to claim 99, wherein said 
plurality of relocation instructions further includes 
relocation instructions in a thirteenth class of 
instructions (e.g. DCIS, DTIS, LCDIS, LDTIS) , each 
relocation instruction in said thirteenth class of 
instructions specifying that a third one of a 
predetermined plurality of pointers (codeA, dataA) is to 
be updated, the type of each relocation instruction in 
said thirteenth class of instructions determining which 
of said predetermined plurality of pointers (codeA, 
dataA) is to be said third pointer. 

110. A method according to claim 109, wherein said 
plurality of information items is to be loaded into a 
plurality of regions of said memory, each of said 
regions containing contiguously a respective group of at 
least one of said plurality of information items, each 
of said regions having a respective designation and a 
respective start address, 

and wherein the updating specified by each given 
relocation instruction in said thirteenth class of 
instructions Includes storing as said third pointer 
(codeA, dataA) the start address of one of said regions 
whose designation is specified in said given relocation 
instruction. 

111. A method according to claim 99, wherein said 
relocation instructions further include a . fourteenth 
class of instructions (e.g. DELTA), each relocation 
instruction in said fourteenth class of instructions 
specifying a value to be added to a second pointer 
(rAddr) into said information items. 

112. A method according to claim 99, wherein said 
relocation instructions further include a fifteenth 
class of instructions (e.g. LABS) , each relocation 
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instruction in said fifteenth class of instructions 
specifying a value to be stored as a second pointer 
(rAddr) into said information items. 

113. A method according to claim 99, wherein said 
• relocation instructions further include a sixteenth 

class of instructions (e.g. RPT, LRPT) , each instruction 
in said sixteenth class of instructions specifying a 
number of times for which an immediately preceding group 
of m of said relocation instructions are to be repeated. 

114. A method according to claim 113, wherein each 
instruction in said sixteenth class of instructions 
further specifies m. 

115. A method for constructing an object code file 
containing a plurality of exports capable of being 
inserted into an importing file to be loaded into a 
computer system memory, each of said exports having an 
eacport symbol name and a respective export symbol 
location value, comprising the steps of: 

writing said exports to said object code file; and 
writing to said object code file an export symbol 
table associating said export synibol names with said 
export symbol location values, said export symbol table 
being hashed. 

116. A method for constructing an object code file 
containing a plurality of exports capable of being 
imported into an importing file to be loaded into a 
computer system memory, each of said esqjorts having an 
escport symbol name and a respective export symbol 
location value, comprising the steps of: 

writing to said object code file an export symbol 
table having a plurality of entries each associating a 
respective one of said export symbol names with its 
respective export symbol location value; and 



t 
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writing to said object code file an export slot 
table which associates each of said export symbol names 
with a predetermined subset of said export table 
entries. 

117. A method for constructing an object code file • 
containing a plurality of exports capable of being 
imported into an Importing file to be loaded into a 
computer system memory, each of said e3q>orts having an 

export symbol name and a respective export symbol 
location value, comprising the steps of: 

writing to said object code file an export symbol 
table associating each of said export symbol names with 
its respective export symbol location value; 

writing to said object code file a chain entry 
corresponding to each of said export symbol names, each 
of said chain entries containing a hashword which is a 
first predefined function of the e3q>ort symbol name 
corresponding to the chain entry, said chain entries 
being grouped into slots, at least one of said slots 
having more than one chain entry, each slot 
corresponding to a respective one of a plurality of hash 
slot values, the hash slot value corresponding to each 
given slot being a second predefined function of each of 
the export symbol names corresponding to a chain entry 
in said given slot. 

118. A method according to claim 117, further 
comprising the step of writing to said object code file 
an export slot table which associates each given one of 
said hash slot values with the chain entries in the slot 
which correspond to said given hash slot value. 

119. A method for constructing a file to be loaded 

into a computer system memory, comprising the steps of: ^ 
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identifying in said file a data pattern consisting 
of RCount3 repetitions of a first subpattern, followed 
by a second subpattern of length Countl bytes, each of 
said repetitions of said first subpattern consisting of 
said second subpattern followed by DCount2 additional 
bytes of data; and 

writing to said file a data escpansion instruction 
comprising Countl, DCount2, RCountS, said first 
subpattern, and each of said additional bytes of data. 

120. An information storage medium carrying a file 
for loading into a con^uter system memory, said file 
having a plurality of data expansion instructions, 
including a first instruction which specifies the 
creation of a data pattern consisting of RCount3 
repetitions of a first subpattern, followed by a second 
subpattern of length Countl bytes, each of said 
repetitions of said first subpattern consisting of said 
second subpattern followed by DCount2 additional bytes 
of data, said first instruction identifying Countl, 
DCount2, RCounta, said first subpattern, and each of 
said additional bytes of data. 

121. A method for loading a file into a computer 
system memory, said file having a plurality of data 
expansion instructions, comprising the steps of: 

retrieving a first one of said data expansion 
instructions; and 

creating a data pattern consisting of RCount3 
repetitions of a first subpattern, followed by a second 
subpattern of length Countl bytes, each of said 
repetitions of said first subpattern consisting of said 
second subpattern followed by DCount2 additional bytes 
of data, said first instruction identifying Countl, 
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DCount2, RCount3, said first subpattern, and each of 
said additional bytes of data. 
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